Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tidy data for social science surveys #119

Open
jeanetteclark opened this issue Mar 9, 2022 · 2 comments
Open

tidy data for social science surveys #119

jeanetteclark opened this issue Mar 9, 2022 · 2 comments

Comments

@jeanetteclark
Copy link
Collaborator

Need to write a lesson on how to create tidy data structures out of survey data

key points to hit:

  • entities, observations, variables (survey population, individual, question response)
  • consistent coding of variables
  • open formats

our example dataset might feature:

  • excel format with tabs?
  • inconsistent coding variables
  • other?

@mbjones would like your input here

@jeanetteclark
Copy link
Collaborator Author

we could also use two tables to show how to separate sensitive and non-sensitive data with participantId as a key

@jeanetteclark
Copy link
Collaborator Author

Data modeling lesson

  • 10 simple rules
  • entities, observations, variables
  • joins and entity relationship diagrams between two tables of fake data(perhaps survey answers and community characteristics, with community being the join variable)
  • group exercise, design tables based on a survey instrument

Data tidying lesson

  • cleaning up text based issues (stringr) in fake dataset created for first lesson
  • miscapitalization, stray whitespaces
  • joins
  • group_by, summarize, filter
  • separate?

Text analysis lesson

  • analyzing the fake dataset free text responses

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant