Category: big data

Releasing 1.8 million open access publications from publisher systems for text and data mining

Text and data mining offers an opportunity to improve the way we access and analyse the outputs of academic research. But the technical infrastructure of the current scholarly communication system is not yet ready to support TDM to its full potential, even for open access outputs. To address this problem, Petr Knoth, Nancy Pontika and Lucas Anastasiou have developed the […]

What factors do scientists perceive as promoting or hindering scientific data reuse?

Increased calls for data sharing have formed part of many governments’ agendas to boost innovation and scientific development. Data openness for reuse also resonates with the recognised need for more transparent, reproducible science. But what are scientists’ perceptions about data reuse? Renata Gonçalves Curty, Kevin Crowston, Alison Specht, Bruce W. Grant and Elizabeth D. Dalton make use of existing survey […]

Without urgent action big and open data may widen existing inequalities and social divides

The juncture of big and open data informs areas as diverse as artificial intelligence, agriculture, and public health, and promises to transform our ability to tackle global challenges. However, Sabina Leonelli highlights three major concerns over how big and open data are currently managed: the unsustainable nature of the digital data landscape; the quality and credibility of the data themselves; […]

An emerging iron cage? Understanding the risks of increased use of big data applications in social policy

Big data technologies are increasingly being utilised in the field of social policy. Although big data methods and strategies are often preferred as a form of evidence-based policy development, big data techniques do not necessarily guarantee scientific objectivity. Hamish Robertson and Joanne Travaglia discuss concerns about the rapid growth in big data methods being used to inform and shape social […]

More data or better data? Using statistical decision theory to guide data collection

When designing data collection, researchers must take important decisions on how much data to collect and what resources to devote to enhancing the quality of the collected data. But the threshold for choosing better over bigger data may be reached long before the sample numbers in the thousands, write Jeff Dominitz and Charles F. Manski. Big data has become an […]

Book Review: Open Data and the Knowledge Society by Bridgette Wessels, Kush Wadhwa, Rachel L. Finn and Thordis Sveinsdottir

In Open Data and the Knowledge Society, authors Bridgette Wessels, Kush Wadhwa, Rachel L. Finn and Thordis Sveinsdottir place the management of open data ecosystems at the heart of the transformation into a “knowledge society”, presenting five case studies through which to consider various ways of dealing with different types of data. Miranda Nell welcomes this book for showing how open data is […]

Collaboration and concerted action are key to making open data a reality

The case for open data is increasingly inarguable. Improved data practice can help to address concerns about reproducibility and research integrity, reducing fraud and improving patient outcomes, for example. Research also shows good data practice can lead to improved productivity and increased citations. However, as Grace Baynes reports, recent survey data shows that while the research community recognises the value […]

Book Review: We Are Data: Algorithms and the Making of Our Digital Selves by John Cheney-Lippold

In We Are Data: Algorithms and the Making of Our Digital Selves, John Cheney-Lippold examines how algorithms increasingly interpret and influence our behaviour. With the author concluding with some pragmatic suggestions for challenging the digital status quo, Daniel Zwi welcomes the book for both capably elucidating the problem of algorithmic regulation and forearming us to tackle this issue. This review originally appeared on LSE Review of […]

Journal policies that encourage data sharing prove extremely effective

There is currently little incentive for researchers to share their data. But what if it was enough for journals to simply ask authors to make their data available? Michèle B. Nuijten reports on a recent study that found journal policies that encourage data sharing to be extremely effective, with a steep increase in the percentage of articles with open data […]

Starter tips on sharing data and analysis scripts

Researchers are increasingly encouraged to make their data openly accessible and usable for others. To early-career researchers in particular, this can seem daunting, with different considerations when posting data publicly rather than retaining it solely for internal use. Katherine Wood has compiled a short open data starter guide to make the process less overwhelming and help researchers do their bit for […]