tidy data for social science surveys #119

jeanetteclark · 2022-03-09T19:36:53Z

Need to write a lesson on how to create tidy data structures out of survey data

key points to hit:

entities, observations, variables (survey population, individual, question response)
consistent coding of variables
open formats

our example dataset might feature:

@mbjones would like your input here

jeanetteclark · 2022-03-09T19:53:45Z

we could also use two tables to show how to separate sensitive and non-sensitive data with participantId as a key

jeanetteclark · 2022-03-10T20:08:01Z

Data modeling lesson

10 simple rules
entities, observations, variables
joins and entity relationship diagrams between two tables of fake data(perhaps survey answers and community characteristics, with community being the join variable)
group exercise, design tables based on a survey instrument

Data tidying lesson

cleaning up text based issues (stringr) in fake dataset created for first lesson
miscapitalization, stray whitespaces
joins
group_by, summarize, filter
separate?

Text analysis lesson

jeanetteclark added the lesson plans label Mar 9, 2022

Provide feedback