This project has always had to deal with uncertainty given that we are collecting data in real time and we thus cannot know ahead of time:
Having project-wide goals helps us coordinate and make sure that despite the uncertainty, we are doing our best to:
Having complete and clean data for many countries over the same time period is crucial to say anything about what the drivers and effects of COVID-19 government policies are
The overall project goals are to:
The time period for which we aim to collect complete and clean data will be different depending on whether a country is a spotlight country or a capsule country and whether subantional data collection is involved:
Spotlight national and subnational countries: Document policies made up until 10/2021 (hard goal)
Capsule National Countries: Document policies made up until 03/2021 (hard goal) + 10/2021 (soft goal)
Capsule Subnational Countries: Document policies made up until 10/2020 (hard goal) + 10/2021 (soft goal)
What is ‘complete’ data? | What is ‘clean’ data? |
---|---|
We can only judge the ‘completeness’ of the data for any given region using:
|
We can judge the ‘cleanliness’ of the data for any given region by
|
I am coding for a Spotlight national or subnational country
Your RM/CM will have quarterly-specific goals designed to help reach the overall project goals which takes into account what the data looks like for your country or subnational region.
In general however, all spotlight national or subnational countries should work in coordination as follows:
For each stage, you should work on making sure the data in your region is complete and clean before moving on to the next stage.
I am coding for a Capsule National Country
Your RM/CM will have quarterly-specific goals designed to help reach the overall project goals which takes into account what the data looks like for your country.
In general however, all capsule national countries should work in coordination as follows:
For each stage, you should work on making sure the data in your region is complete and clean before moving on to the next stage.
I am coding for a Capsule Subnational Country
Your RM/CM will have quarterly-specific goals designed to help reach the overall project goals which takes into account what the data looks like for your country.
In general however, all capsule subnational countries should work in coordination as follows:
If your subnational region has reached ‘Stage 1’ you should not code any further for your subnational region— please coordinate with your country manager to help get data for other subnational regions in your country complete and clean at Stage 1
I am a regional or country manager
In general, the workflow for achieving these goals will look like:
For Fall 2021 we will aim to stick to the following timeline:
To access the some of the elements listed in the workflow Chart, please see the table below:
How to get to complete data? | How to get to clean data? | |
---|---|---|
Processes | Quarterly Survey | |
Goal Making | ||
For RMs/CMs: Region/country specific completeness goals (e.g. number of policies to integrate, checking government sources or sources on Overton/Jataware until a certain date) | For RMs/CMs: Region/country specific cleaning goals (e.g. policy types to check for cleanliness, fixing X problems through automated assessment of data problems) | |
Assessment of Current State of the data | ||
For RAs: Internal assessment of e.g. completeness, complexity of the policy-making process via the RA Internal Survey (link for August 2021 survey here) | For RAs: internal assessment of the quality of the data via the RA Internal Survey (link for August 2021 survey here) | |
Monthly RM/CM Feedback on the region and progress towards goals (ideally should take 15-20 minutes to fill out ) | ||
For RMs/CMs: Update on whether quarterly regional/country goals are on track | For RMs/CMs: Feedback on problems/developments in the region | |
Tools | Data Overviews | |
CoronaNet Tableau Visualization Overview : - Overview of number of policies currently coded in CoronaNet per country/province and policy type over time in both visual and table formats - Overview of number of policies to integrate from other datasets per country/province and policy type over time |
Automated Data Quality Checks - Identification of which policies need to be cleaned according to the automated assessment - Identification which policies have the 'wrong' policy type according to data science models in both visual and table formats (along with update and correction links; forthcoming) |
|
Data Integration | CornEdit App [link TBD] | |
Data Integration Sheets - Data from external data which allows RAs to i) assess the overlap between external data and CoronaNet data ii) integrate/recode data into CoronaNet taxonomy |
Easy to use tool for making corrections to common mistakes | |
Shiny App | ||
Use the Shiny App for: - Visualization of a timeline of policies - Access to table format of policies - Update and correction links for policies in Qualtrics |
||
Information Resources | ||
CoronaNet Dashboard : Main portal which has detailed information on all resources and up to date information on changes in the project | ||
Slack : Main communication platform for the project, interact with other project members here! | ||
Overton Raw Sources /Jataware/Starsift Raw Sources: Access to potential raw sources about government COVID-19 actions | CoronaNet Previously Uploaded PDFs : PDFs of previously coded policies | |
Low State Capacity Guidelines: Guidelines for how to document policies for countries with low state capacity | CoronaNet Coding Guide | |
CoronaNet RA Previous Materials : Access to materials and information generated by RAs | ||
CoronaNet Skeleton : Details and examples of how the data should in theory be structured | ||
CoronaNet Survey : R markdown version of the Qualtrics survey | ||
CoronaNet PDF Codebook : Detailed information on each survey question | ||
CoronaNet Condensed Taxonomy : Detailed information how a subset of variables related to each other | ||
CoronaNet Duplicate Detector : Helps you assess whether a policy you are thinking of coding is already in the dataset or not | ||
CoronaNet Policy Predictor : Helps predict the best policy type for coding a given policy | ||
RA FAQs [Link TBD] : Summary of commonly asked questions asked in ra-chat |
How should I prioritize coding different policy types under this overall strategy?
As before, wherever possible, RAs should still prioritize coding the following policy types first wherever possible. Having the same priorities for policy types also helps ensure consistency and completeness along this dimension of the data, which improves cross-regional analysis of the data for these policy types.
Group 1
Group 2:
Group 3:
Group 4
Group 5:
Group 6:
Group 7:
What do we do with the data checklists?
We will be relying on the RA Internal Survey for much of the information that we used to get from the data checklists. However, if you’ve found them to be useful tools for yourself, we’ll still keep them around and you’re free to use them if you find them helpful!