Category: data quality

More data or better data? Using statistical decision theory to guide data collection

When designing data collection, researchers must take important decisions on how much data to collect and what resources to devote to enhancing the quality of the collected data. But the threshold for choosing better over bigger data may be reached long before the sample numbers in the thousands, write Jeff Dominitz and Charles F. Manski. Big data has become an […]

Excel is threatening the quality of research data — Data Packages are here to help

This week the Frictionless Data team at Open Knowledge International will be speaking about making research data quality visible at the International Digital Curation Conference (#idcc17). Dan Fowler looks at why the popular file format Excel is problematic for research and what steps can be taken to ensure data quality is maintained throughout the research process. Our Frictionless Data project aims […]