Clean up raw CDI output (item + summary) for any project administered through WebCDI.
wrangle_web_cdi.RdClean up raw CDI output (item + summary) for any project administered through WebCDI.
Arguments
- form
Which kind of cdi form is this (
WGorWS). Will only include columns from each form.- table
Which subset of the output to include? All table include id-related columns: "study_name", "subject_id", "repeat_num".
"summary" (default): each row is one cdi form, including aggregated score for each form (e.g. total words produced, total number of first gestures, etc.).
"wordlevel": each row is an item on one cdi form, with an "item" column for the name of the item and a "response" column.
"raw": this is the rawest form of the data as downloaded from webcdi, with id-related columns added, each row is one cdi form, including columns for each item and columns for aggregated score, as well as metadata columns (e.g. date created, webcdi link, admin id, etc.). Is not affected by
withDemographicandjustWordparameters.
- withDemographic
Should the data include answers to demographic questions (e.g. sibling count, birth order, caregive info, etc.)? These columns are not treated as item, so if you select
wordlevel_longtable, the demographic answers will be included in every row. If you selectrawtable, demographic columns will always be included.FALSEis default.- justWord
Should the data include only vocabulary checklist item? If
TRUE(default), returns onlyWords ProducedandWords Understoodrelated columns forsummarytable and vocabulary items forwordleveltable. IfFALSE, will include gestures-related items forWGform and sentence-related items forWSform. If you selectrawtable, all items will be included regardless of this variable.- rename
For
WSform, do we standardize the name of the word-related summary item to be the same asWGform. IfTRUE(default), the three summary columns in WS namedTotal Produced,Total Produced Percentile-sex, andTotal Produced Percentile-bothwill changed toWords Produced,Words Produced Percentile-sexandWords Produced Percentile-both. Otherwise, they will remain unchanged- filepath
Dataframe of the csv file downloaded from WebCDI (you should use the csv with both item level and sumary data)