CEP & CSV before starting
Dear OFT,
I am preparing a survey, and I would appreciate your responses and advices on the below points, which I need to clarify before I start the implementation of the project with Collect Earth. Just to note, that the sampling design (number, and position/coordinates of the plots) has been defined and fixed:
1. Is it possible to add further information to the plots after the end of the “interpretation phase” (i.e. the phase where the users fill in the question forms) in order to be able to perform analysis (e.g. Saiku) with all of it at the end? Meaning that the project can be initiated and completed having a CSV file with only the basic columns as specified by the CSV grid template (id, Ycoord, Xcoord, elevation, aspect, slope) and after its completion to add/assign ***other*** (e.g. vegetation zones, soil zones, climate zones, etc) spatially available information (info available from a shp per plot) to the plots (based on previous responses this will require the change of schema design with adding the respective attributes and the change of the CSV file with adding respective columns)?
2. Should I need to know in advance (i.e. before the initiation of the project) the information to be added later, at least as a “title”, in order to add in the CSV file the headings for this additional information and in the schema the attributes (and keep the records in the CSV file blanks), or I only need to have a CSV file (with columns) and attributes in the schema design, only for the already available information at the time the project begins (i.e. slope, aspect, elevation, coordinates)?
3. Should the region_areas.csv and/or areas_per_attribute.csv files be contained in the CEP file from the beginning of the project, or can be added later (e.g. at the end), since from what I understand, are being used for the final analysis?
4. For the attributes labeled as “coming from CSV”:
i. In the survey design, should I always mark the “only when expression is verified” using the “false()” function and having them as “Not required” and “hide when relevant”?
ii. What is the purpose of having a code attribute with a code list, since the values of the attributes are coming from the csv (i.e. already spatially available info)?
5. Is there a way to customize the plot size to other than the default given values (e.g. to 1.5 ha)?
6. The sampling design resulted to more than 10,000 plots. I have already a CSV file with information on id (text attribute such as XXX_1, XXX_2), Ycoord , Xcoord, slope, elevation, and aspect, all of them per plot. The users that will be involved are more than five (not in a network). I would like to provide them with a specific amount of plots to work on, with, each time (e.g. 500 plots) avoiding overlapping:
iii. Could you advice me on how to proceed with that split of the plots? Am I going to split the CSV file, to a number of CSV files containing e.g. 500 plots each, and providing the users with one of those CSV files each time? Or is there a more efficient and more secure way, in order at the end the incorporation into one to be easier? Am I going to provide the users with a different CEP file, every time the CSV file changes?
Just to note that I tried to import my CSV file in the survey from the survey designer (survey tab, in the Grid type) and I got the message “The number of plots (lines) in the CSV file should not be larger than 3997.”
Thanks a lot, and apologies for this extended long list of questions.
Regards,
itzam