Category: DMPonline

RDA 11th Plenary Berlin: Active DMP Excitement

Now that the dust has settled, here are a few thoughts and reflections from March’s 11th RDA Plenary in Berlin, with a focus on Data Management Plans (DMPs).

DMP Common Standards WG

My participation in this plenary began with the DMP Common Standards WG meeting tasked with finding practical ways to make DMPs machine-actionable. We were pleasantly surprised by the number of participants, most of who perhaps aren’t interested in the technical minutiae of the WG conference calls, but are nevertheless interested in the general discussion and wish to be included in shaping the outputs and recommendations of this group.* [Theme 1]

More importantly, many participants had questions to ask and insights to offer during the Towards a Common Data Model moderated discussion to the extent that, unavoidably, many conversations spilled over into the coffee break and the immaculate corridors of Berlin’s Congress Center! This happily points to a clear need to listen and gather insights from as wide a stakeholder base as possible if we want our recommendations to be useful to the widest possible audience.

Points of interest:

  • Lightning talks where we presented various DMP-related tools’ approaches to modelling this subject area. [Slides]
  • By Common Data Model we mean finding a way to let funders ask their questions and receive answers in a standard, well-defined way that does not restrict expressiveness and represents the very diverse approaches and requirements that different groups and organisations have.* [Theme 2]
  • DMPRoadmap data model’s emphasis on KISS principle (Keep It Simple & Straightforward): 
    Plans use Templates – Templates have Phases, which contain Sections with Questions in them.
  • Missing DMP data may be the result of a question that was never asked. [DCC‘s Kevin Ashley]
  • There may be a middle ground between free-text and machine-readable content in the concept of Themes.
  • More emphasis on integration between APIs, data sources and tools as opposed to simply exporting/sharing DMPs. 

Emerging themes:

  1. The first emerging theme was convergence & collaboration: collect as many user stories as possible for maximum inclusivity.
  2. By extension, the need to avoid overspecification & overengineering in the WG’s outputs, to prevent having to shoehorn diverse standards and methodologies into a narrow data/metadata model in the future.
    • This is what makes many recommendations that look good on paper unusable in the real world.

Exposing DMPs WG

Another full room at P11, this time for the Exposing DMPs WG meeting:

  • Lightning talk presentations on the subject of sharing DMP content by Angus Whyte (DCC), David Carr (Wellcome), Elena Zudilova-Seinstra (Elsevier RDM Solutions), Iain Hrynaszkiewicz (BMC Research Notes), Sandra Gesing (Open Science Framework), Stephanie Simms (CDL).
  • Use cases for exposing DMPs
  • Real-time voting: Which use cases should we spend our time on? – showed great interest on the subjects of Integration and Evaluation.

Favourite moment

Resolutely debugging the RDA Metadata Standards Catalog with Alex Ball during a coffee break – having coffee standing up Italian style, furiously typing into laptops & tablets, we must have been a sight!

RDA Plenaries are an excellent opportunity for learning and collaboration, as there’s so much experience around in many different subject areas – so many user stories, and so many different perspectives to stimulate conversation and one’s interest in previously unexplored topics.

 

Full room at DMP Common Standards WG… …and at Exposing DMPs WG!

Photos: Tomasz Miksa, Jimmy Angelakos

Pizza Party!

The DMPTool team has embarked on a major housekeeping effort in order to migrate to the DMPRoadmap platform in February 2018. Last week they began a global audit of the funder templates and guidance in an all-day pizza-fueled event that amounted to a h…

Roll up, roll up! Get yer DMP update here.

Image Paper seller and bench CC-BY-NC-ND By Henry…
Last month saw a busy Active DMPs and Domain Repositories Interest Groups joint session at the RDA Plenary at Montreal. Two new working groups have been launched to advance work in this area: one on…

DMPRoadmap summer camp news

Image credit: Airstream CC-BY-NC by dwstucke
 
This summer we’ve made solid progress toward our DMPRoadmap MVP, done oodles of outreach for machine-actionable DMPs, and addressed some DMPTool and DMPonline-specific items. Keep reading for t…

On the right track(s) – DCC release draws nigh

Eurostar by red hand records CC-BY-ND
Preliminary DMPRoadmap out to test
We’ve made a major breakthrough this month, getting a preliminary version of the DMPRoadmap code out to test on DMPonline, DMPTuuli and DMPMelbourne. This has taken longer …

RDA-DMP movings and shakings

An update on RDA and our Active DMP work, courtesy of Stephanie Simms

RDA Plenary 9 
We had another productive gathering of #ActiveDMPs enthusiasts at the Research Data Alliance (RDA) plenary meeting in Barcelona (5-7 Apr). Just prior to the meet…

Roadmap retrospective: 2016

 
Here’s an update on DMPRoadmap, courtesy of Stephanie Simms at CDL
 
2016 in review
 
The past year has been a wild ride, in more ways than one… Despite our respective political climates, UC3 and DCC remain enthusiastic about our partnership and the future of DMPs. Below is a brief retrospective about where we’ve been in 2016 and a roadmap (if you will…we also wish we’d chosen a different name for our joint project) for where we’re going in 2017. Jump to the end if you just want to know how to get involved with DMP events at the International Digital Curation Conference (IDCC 2017, 20–23 Feb in Edinburgh, register here).
 
In 2016 we consolidated our UC3-DCC project team, our plans for the merged platform (see the roadmap to MVP), and began testing a co-development process that will provide a framework for community contributions down the line. We’re plowing through the list of features and adding documentation to the GitHub repo—all are invited to join us at IDCC 2017 for presentations and demos of our progress to date (papers, slides, etc. will all be posted after the event). For those not attending IDCC, please let us know if you have ideas, questions, anything at all to contribute ahead of the event!
 
DMPs sans frontières 
 
Now we’d like to take a minute and reflect on events of the past year, particularly in the realm of open data policies, and the implications for DMPs and data management writ large. The open scholarship revolution has progressed to a point where top-level policies mandate open access to the results of government-funded research, including research data, in the US, UK, and EU, with similar principles and policies gaining momentum in Australia, Canada, South Africa, and elsewhere. DMPs are the primary vehicle for complying with these policies, and because research is a global enterprise, awareness of DMPs has spread throughout the research community. Another encouraging development is the ubiquity of the term FAIR data (Findable, Accessible, Interoperable, Reusable), which suggests that we’re all in agreement about what we’re trying to achieve.
 
On top of the accumulation of national data policies, 2016 ushered in a series of related developments in openness that contribute to the DMP conversation. To name a few:
 
  • More publishers articulated clear data policies, e.g., Springer Nature Research Data Policies apply to over 600 journals.
  • PLOS now requires an ORCID for all corresponding authors at the time of manuscript submission to promote discoverability and credit.
  • The Gates Foundation reinforced support for open access and open data by preventing funded researchers from publishing in journals that do not comply with its policy, which came into force at the beginning of 2017; this includes non-compliant high-impact journals such as Science, Nature, PNAS, and NEJM.
  • Researchers throughout the world continued to circumvent subscription access to scholarly literature by using Sci-Hub (Bohannon, 2016).
  • Library consortia in Germany and Taiwan canceled (or threatened to cancel) subscriptions to Elsevier journals because of open-access related conflicts, and Peru canceled over a lack of government funding for expensive paid access (Schiermeier and Rodríguez Mega, 2017).
  • Reproducibility continued to gain prominence, e.g., the US National Institutes of Health (NIH) Policy on Rigor and Reproducibility came into force for most NIH and AHRQ grant proposals received in 2016.
  • The Software Citation Principles (Smith et al., 2016) recognized software as an important product of modern research that needs to be managed alongside data and other outputs.
This flurry of open scholarship activity, both top-down and bottom-up, across all stakeholders continues to drive adoption of our services. DMPonline and the DMPTool were developed in 2011 to support open data policies in the UK and US, respectively, but today our organizations engage with users throughout the world. An upsurge in international users is evident from email addresses for new accounts and web analytics. In addition, local installations of our open source tools, as both national and institutional services, continue to multiply (see a complete list here). 
 
Over the past year, the DMP community has validated our decision to consolidate our efforts by merging our technical platforms and coordinating outreach activities. The DMPRoadmap project feeds into a larger goal of harnessing the work of international DMP projects to benefit the entire community. We’re also engaged with some vibrant international working groups (e.g., Research Data Alliance Active DMPs, FORCE11 FAIR DMPs, Data Documentation Initiative DMP Metadata group) that have provided the opportunity to begin developing use cases for machine-actionable DMPs. So far the use cases encompass a controlled vocabulary for DMPs; integrations with other systems (e.g., Zenodo, Dataverse, Figshare, OSF, PURE, grant management systems, electronic lab notebooks); passing information to/from repositories; leveraging persistent identifiers (PIDs); and building APIs. 
 
2017 things to come
This brings us to outlining plans for 2017 and charting a course for DMPs of the future. DCC will be running the new Roadmap code soon. And once we’ve added everything from the development roadmap, the DMPTool will announce our plans for migration. At IDCC we’ll kick off the conversation about bringing the many local installations of our tools along for the ride to actualize the vision of a core, international DMP infrastructure. A Canadian and a French team are our gracious guinea pigs for testing the draft external contributor guidelines.
 
There will be plenty of opportunities to connect with us at IDCC. If you’re going to be at the main conference, we encourage you to attend our practice paper and/or join a DMP session we’ll be running in parallel with the BoFs on Wednesday afternoon, 22 Feb. The session will begin with a demo and update on DMPRoadmap; then we’ll break into two parallel tracks. One track will be for developers to learn more about recent data model changes and developer guidelines if they want to contribute to the code. The other track will be a buffet of DMP discussion groups. Given the overwhelming level of interest in the workshop (details below), one of these groups will cover machine-actionable DMPs. We’ll give a brief report on the workshop and invite others to feed into discussion. The other groups are likely to cover training/supporting DMPs, evaluation cribsheets for reviewing DMPs, or other topics per community requests. If there’s something you’d like to propose please let us know!
 
IDCC DMP utopia workshop
We’re also hosting a workshop on Monday, 20 Feb entitled “A postcard from the future: Tools and services from a perfect DMP world.” The focus will be on machine-actionable DMPs and how to integrate DMP tools into existing research workflows and services.  
 
The program includes presentations, activities, and discussion to address questions such as:
  • Where and how do DMPs fit in the overall research lifecycle (i.e., beyond grant proposals)?
  • Which data could be fed automatically from other systems into DMPs (or vice versa)?
  • What information can be validated automatically?
  • Which systems/services should connect with DMP tools?
  • What are the priorities for integrations?
We’ve gathered an international cohort of diverse players in the DMP game—repository managers, data librarians, funders, researchers, developers, etc.—to continue developing machine-actionable use cases and craft a vision for a DMP utopia of the future. We apologize again that we weren’t able to accommodate everyone who wanted to participate in the workshop, but rest assured that we plan to share all of the outputs and will likely convene similar events in the future. 
 
Keep a lookout for more detailed information about the workshop program in the coming weeks and feel free to continue providing input before, during, and afterward. This is absolutely a community-driven effort and we look forward to continuing our collaborations into the new year!

DMP themes: And then there were 14…

We issued a call for input on the DMP themes in late September and received feedback from across the UK, Europe and the USA. Many thanks to all who responded. It’s really helped to confirm our thinking.
 
We asked a few specific questions:

The DCC and its services

The DCC has undergone continuous change since it was established as a consortium in 2004 jointly funded by JISC and the UK E-Science programme. Periodically it’s important for us to clarify what these changes are and what implications they do – or don’t – have for the services you expect from us. The last six years have been marked by a deliberate diversification of the DCC’s income streams, an intensification of its international role and a corresponding reduction in its dependence on a core funding stream from Jisc which was under increasing pressure as that organisation went through a transition process following the Wilson review. Over the years, we’ve moved some activities to a cost recovery basis, increased project funding from other sources such as the European Commission and grown a healthy income stream from online services and consultancy. This trend continues as the core funding stream from Jisc came to an end in July 2016.

What does this mean for UK universities and researchers? In many ways, very little – in others, potentially a lot. Our existence is secure and our finances are healthy. We have a business plan for the next 5 years that anticipates growth and is grounded in reality. That’s been enough for our lead host, the University of Edinburgh, to give us the backing we need for continuity of services and staff. The University recognises the importance of having an impartial, national service and values the international recognition and prestige that hosting the DCC brings.

The main service we want to provide assurances on is DMPonline. We will continue to provide this as a UK national and international service and can guarantee ongoing support for a minimum of 2 years with a promise of at least 2 years notice should we need to make changes to that provision. We already have overseas customers for DMPonlne to whom we have made long-term commitments. The service is also a key component of a number of European e-infrastructure projects.  We’re engaging in discussions with key UK representative organisations to find the right business model for long-term UK provision and we welcome your views on that. The DCC has pioneered work in this area with funding support from BIS, Jisc, the European Commission and the University of Edinburgh’s Information Services Innovation Fund amongst other sources. We’re grateful to all of them for their support, past and present. We continue to commit significant resources, and in collaboration with our partners at the California Digital Library, are co-creating a single DMP platform that we’re confident is the world’s best. Current international initiatives on DMPs require collaboration and coordination of the key players and the DCC will continue to push that agenda through the Research Data Alliance and other appropriate bodies.  We are committed to ensuring that the rich expertise held by our staff remains accessible to the community. DMPonline will remain free to use to researchers. We continue to be able to provide support for institutional branding and customisation on a service contract or consultancy basis.

Our events such as RDMF and IDCC have been covering their costs now for some years, and we will continue to operate them as long as demand exists, and initiate new events where we see a requirement that is unlikely to be met by another agency. Identifying issues of common concern and providing fora to bring communities together to address them has always been part of our remit.  Our training has also been running on a cost-recovery basis for some time, and has expanded as a result. We continue to run courses that are open to all as well as in-house custom events for organisations in the UK and elsewhere.

We’ll also continue to produce publications and guidance of the quality you have come to expect from us alone or in collaboration with others, including The International Journal of Digital Curation (IJDC). We will also continue to engage with international bodies such as the Research Data Alliance, of which we are a founding organisational member. We’ll report back to you on what is happening and provide a channel for your concerns and ideas and/or foster participation by you and your colleagues.

Many of you will know that we continue to be involved in a number of European projects building and exploiting research infrastructures such as EUDAT, OpenAIRE and the European Open Science Cloud Pilot, and are doing an increasing amount of consultancy, which now provides over 20% of our income.  Our clients include universities, funders and a variety of international bodies.

We’ve spoken with many of our contacts about these changes but we realise that they may be a surprise to some. We’re sorry if that’s the case; be assured that our mission remains unchanged – to increase the capability and capacity of organisations worldwide to engage in data curation which fosters data use and reuse.  If you would like to know more about any aspect of our work, explore a collaboration or offer input on anything covered by this post, contact the DCC information desk at info@dcc.ac.uk or, if you prefer, contact me directly – director@dcc.ac.uk.

Finding our Roadmap rhythm

In keeping with our monthly updates about the merged Roadmap platform, here’s the short and the long of what we’ve been up to lately courtesy of Stephanie Simms of the DMPTool:

Short update

Long(er) update

This month our main focus has been on getting into a steady 2-week sprint groove that you can track on our GitHub Projects board. DCC/DMPonline is keen to migrate to the new codebase so in preparation we’re revising the database schema and optimizing the code. This clean-up work not only makes things easier for our core development team, but will facilitate community development efforts down the line. It also addresses some scalability issues that we encountered during a week of heavy use on the hosted instance of the Finnish DMPTuuli (thanks for the lessons learned, Finland!). We’ve also been evaluating dependencies and fixing all the bugs introduced by the recent Rails and Bootstrap migrations.

Once things are in good working order, DMPonline will complete their migration and we’ll shift focus to adding new features from the MVP roadmap. DMPTool won’t migrate to the new system until we’ve added everything on the list and conducted testing with our institutional partners from the steering committee. The CDL UX team is also helping us redesign some things, with particular attention to internationalization and improving accessibility for users with disabilities.

The rest of our activities revolve around gathering requirements and refining some use cases for machine-actionable DMPs. This runs the gamut from big-picture brainstorming to targeted work on features that we’ll implement in the new platform. The first step to achieving the latter involves a collaboration with Substance.io to implement a new text editor (Substance Forms). The new editor offers increased functionality, a framework for future work on machine-actionability, and delivers a better user experience throughout the platform. In addition, we’re refining the DMPonline themes (details here)—we’re still collecting feedback and are grateful to all those who have weighed in so far. Sarah and I will consolidate community input and share the new set of themes during the first meeting of a DDI working group to create a DMP vocabulary. We plan to coordinate our work on the themes with this parallel effort—more details as things get moving on that front in Nov.

Future brainstorming events include PIDapalooza—come to Iceland and share your ideas about persistent identifiers in DMPs!—and the International Digital Curation Conference (IDCC) 2017 for which registration is now open. We’ll present a Roadmap update at IDCC along with a demo of the new system. In addition, we’re hosting an interactive workshop for developers et al. to help us envision (and plan for) a perfect DMP world with tools and services that support FAIR, machine-actionable DMPs (more details forthcoming).

Two final, related bits of info: 1) we’re still seeking funding to speed up progress toward building machine-actionable DMP infrastructure; we weren’t successful with our Open Science Prize application but are hoping for better news on an IMLS preliminary proposal (both available here). 2) We’re also continuing to promote greater openness with DMPs; one approach involves expanding the RIO Journal Collection of exemplary plans. Check out the latest plan from Ethan White that also lives on GitHub and send us your thoughts on DMP workflows, publishing and sharing DMPs.