Blog

Research Data Management Highlights: Digital Infrastructure Summary Summit 2014 Summary Document

This week, a very significant document regarding the future of research data management and digital stewardship landed on my desktop. This is a PDF all academic librarians in Canada must read – whether or not you are tasked with RDM. If you are in IT, Research Facilitation, REB, Industry Compliance, or are a researcher or an administrator, then you should read this, too. It conveys the pressing importance of RDM to the profession, and it shows that we have an opportunity at hand if we take it – or a storm brewing if we turn it away.

The document is the Summary Report for the Digital Infrastructure Summit 2014This conference was hosted by the Leadership Council for Digital Infrastructure in January 2014. Group representation included CARL, CRKN, CANARIE, TC3+, and CUCCIO; in all 140 participants took part (p. 1). This document outlines the outcomes of the summit, which argued that RDM is lacking in Canada, that a sincere commitment to digital stewardship and not just technology is required to move forward, and the time to act is now (p. 1). If you are a Canadian academic librarian, download the document and read it now.

Note: I was not a participant of this summit and am only summarizing the PDF in regards to RDM in Canada for librarians. I’m standing on the shoulders of giants when I write this post.

This document asks What is Digital Infrastructure (DI)?, considers the existing problems that are hampering the development of an effective DI in Canada, and traces a clear path forward on which the Canadian research enterprise should move. Research Data Management and the people involved in it are front and centre in this document, and this means academic librarians and preservationists. The library has a significant role to play, and we are expected to contribute.

Digital Infrastructure and Soft Skills

One of the document’s biggest takeaways – and what I argue should be one of the first talking points you should use when discussing research data management – is that digital infrastructure (DI) is far more than technology alone. The executive summary states in clear, plain language that digital infrastructure includes “our ability to capture, manage, preserve, and use data . . . data are infrastructure, as are the highly skilled personnel who facilitate access to data, computational power and networks” (p. 1). DI requires “skilled knowledge management personnel” (p.1) who have technical capacity, but as we see elsewhere in the document, also can participate in local and national policy formulation and interpretation, understand project management, and have the capacity to collaborate and lead in their own field and in others. These are a suite of advanced “soft skills” that are concomitant IT knowledge and experience, and they are bound together with other essential criteria such as sustained funding and ongoing government and industry support, which allow research data management to flourish rather than wither on the vine. A successful solution that addresses near-team and long-team RDM issues requires skilled, committed resources on the ground who are leading the way. DI cannot be left to colleagues on limited term appointments or to our grad students. It demands institutional memory and it requires organizational vision.

I’ve mentioned the argument in the above paragraph in a post long ago, but I’ll take this opportunity to link out again. Chuck Humphrey states this in clear terms when he explains that RDM is the “what” and the “how”, and digital stewardship is the “who”, and both are necessary requirements in RDM infrastructure. If you are a librarian, then read Chuck’s website. If you are a Canadian librarian, then read it again. And again.

What’s wrong with Canada’s Digital Infrastructure?  

The Leadership Council has cut right to the chase in their document. They want you, the reader, to know right away that there are real issues affecting digital scholarship in Canada:

  1. Our research data are a national asset, and they are not stewarded properly (p. 1). Canada needs to get up to speed, quickly. It needs RDM and it needs it now. It requires data storage infrastructure it doesn’t have at present. It requires better skills training. It requires better software development. (Fellow Librarians: This is all about us.)
  2. There is very little governance and coordination (p. 1-2). There are many, many players, from funding agencies to libraries to standards organizations to researchers themselves. We are all trying hard to fix this, but we’re not working together. Our governance model is weak. Time is lost, efforts are duplicated, and we are spinning our wheels. (Fellow Librarians: This is very much about us. Get in there and make it happen.)
  3. There is very little federal policy regarding DI (p. 2). This is related but distinct from the second point. With little direction from government, the community is looking in all directions all at once. Greater coordination, planning, and sustained, reliable investment would be beneficial to the national research enterprise. (Fellow Librarians. This, too, is about us. Do. Take part. Take charge.)

I support it's not a blog post if you don't add a worldle.

How to act. How to improve RDM. How to solve this crisis.

Note: I am focusing on mainly on RDM and digital stewardship in this post; the original document gives equal attention to other areas such as governance, policy, and funding.

Research data management/stewardship is as yet the weakest link in the Canadian DI landscape, despite the massive increases in the amount of data being created daily through the research process. There is currently no agreed-upon strategy and/or the capacity to protect this valuable public asset, with little capacity to support access, use and reuse by a wide range of users. (p. 6)

The document makes a strong case not just for increased technical infrastructure but for greater knowledge management, project management, and policy analysis. We simply cannot allow ourselves to dump data files one after another onto a server and then hope that serendipity or an as-of-yet uncoded search algorithm will help us organize, preserve, and provide access to these files in the future. Research data – especially publicly funded research data – are a public good, and they require maintenance, management, and care.

The document highlights significant RDM gaps in Canada that must be addressed. These are:

  • Lack of a core RDM resource (p. 8)
      • Canada requires a national data service, which can lead in stewardship, policy, and education. RDC, CARL, and CRKN all have assets to contribute in this regard; RDC has shown incredible strength in this area already
  • Lack of strategy (p. 9)
    • Canada has no high-level strategy framework guiding debate and decisions on standards, infrastructure and distribution access networks, obligations to existing international agreements; funding
  • Lack of Policy leadership (p. 9)
    • Tri-Council should take the next step and implement RDM policy under consideration.
  • Weak RDM culture (p. 9-10)
    • The benefits that RDM brings must be better articulated.
  • Lack of understanding of Digital Infrastructure (p. 10)
    • It is incumbent that stakeholders demonstrate to the greater community that digital infrastructure necessarily includes the data, and the professionals who steward them
  • Lack of training (p. 10)
    • RDM training is inconsistent at present and must be improved in the short-term for practitioners and researchers alike
  • Weak policy on long-term data lifecycle management (p. 11)
    • Like any collection, data must be managed in part because its supports are not without cost. Management will include asking tough questions like what should be preserved, if we have the means and capacity to preserve it, and for what length of time. I recommend that we all have discussions about data collection policies as soon as possible. Locally, in our consortia, and nationally.
  • Lack of Storage (p. 11)
    • Storage capacity for all disciplines must be addressed. RDM is in no way an “X not Y” proposition. We must serve all discipline, departments, faculties, and researchers.
  • Means to foster acceptance (p. 11)
    • This is a tricky issue. We need our researchers to accept and be a part of RDM. Compliance should be required, but strict policies at the outset may prevent too much pushback. There will be give-and-take in the beginning.
    • Note: The original document refers to “compliance” here. I don’t want to use that term. Do we need sticks? Yes. Do we want to use them? Only if we have to. But from the outside, we must have the attitude that everyone is a partner in this venture.

Good data stewardship is not just a researcher’s responsibility, but it also needed at institutional, organizational, national, and disciplinary levels. (p. 10)

Making things happen and getting things done.

The LC provides a roadmap for action and results in its summary report from its 2014 Digital Infrastructure Summit. I am focusing on RDM-related activities and policy in this post since they are both so important to me, so I do encourage you to read the entire document yourselves to see the entire action plan.

The LC’s ways forward for RDM and policy include:

  • Maintain the Leadership Council and analyze its organizational structure (p. 17-18)
    • A steering committee is required and the LC has done a good job this far. That said, there are clamours and a need for greater representation. Consider increasing membership, developing an executive committee, form working groups, and establishing a Charter and Secretariat
  • Engage government (p. 19)
    • The LC had developed a strong community-driven response to RDM challenges. That said, push government – again – for improved coordination of policy and funding
  • Establish a national RDM network (p. 20)
    • Working on CARL and CRKN’s leadership and experience in this area, establish a network focusing on services, tools and tech, and education
  • Create an RDM pilot (p. 21)
    • Develop pilot discipline-based RDM programmes in three domains: astronomy, social sciences, and medical genomics
  • Coordinate with CRKN’s Integrated Digital Scholarship Ecosystem (ISDE) (p. 22)
    • Engage with this initiative that will enable next-gen library collaboration for seamless access, and improved infrastructure
  • Develop an RDM metrics pilot (p. 22-23)
    • For assessment, understanding performance

If you have made it this far in the post, then I offer you my congratulations. There is a lot of information to synthesize, but it is vital that academic librarians in Canada understand what is on the horizon for our profession, and what role will be expected of us. As this post shows, the work that follows – the opportunity we can take hold of – is as much resource-related and people-related as it is tech-related. To discuss digital infrastructure is to discuss the people who make it happen. Research Data Management doesn’t happen on its own. RDM requires careful planning, policy interpretation, technical capacity, and a thorough understanding of resource management.  

And yes, this is an opportunity for us. But we must be ready for what is to come. RDM will soon become the coordinated response to big data in Canada as it is elsewhere in the developed world, and it will mean work. But this is our work. It is our field. Take heed, take note, ask questions, and get set. Plan for this, and get set to play a leading role, because things are going to get busy.

tl;dr : read this now.  apply it to your work.

 

Required Reading, 9 January 2014

cropped-3554628032_97be3b3319_b.jpg

Required Reading:

  • Joyce, R. (ed.) (2013) : Research Data Management: Practical Strategies for Information Professionals
    • “This volume provides a framework to guide information professionals in academic libraries, presses, and data centers through the process of managing research data from the planning stages through the life of a grant project and beyond. It illustrates principles of good practice with use-case examples and illuminates promising data service models through case studies of innovative, successful projects and collaborations.”
  • Vines, T.H., et al. (2013) : The Availability of Research Data Declines Rapidly with Article Age
    • This link truly is required reading.  Vines et al. conduct a statistical analysis that shows the persistent decline in the availability of and access to research data as well as the lowered chances of finding a working PI e-mail address) over time in scholarly literature. This is the proof you can give to doubting Thomases about the need for proper research data management and digital preservation principles.
    • “Our results reinforce the notion that, in the long term, research data cannot be reliably preserved by individual researchers, and further demonstrate the urgent need for policies mandating data sharing via public archives.”
    • [Mendeley link]

Required Reading, 8 January 2014

cropped-3554628032_97be3b3319_b.jpg

Required Reading:

  • OCLC : Starting the Conversation: University-wide Research Data Management Policy
    • A call for action that summarizes the benefits of systemic data management planning and identifies the stakeholders and their concerns. It suggests that the library director proactively initiate a conversation among these stakeholders to get buy-in for a high-level, responsible data planning and management policy that is proactive, rather than reactive. It also addresses the various topics that should be discussed and provides a checklist of issues to help the discussion result in a supportable and sustainable policy.”
  • Chronicle : Born Digital, Projects Need Attention to Survive
    • “A team . . . often based in academic libraries or digital-scholarship centers-has to conduct regular inspections and make sure that today’s digital scholarship doesn’t become tomorrow’s digital junk.
      . . .
      Mr. Daigle advises scholars who want to pursue digital-humanities work to consult with their librarians and put long-term archiving strategies in place early on. ‘Think about the life cycle of preservation,” he says. “The more you do that, the longer it’s going to be around, and that is time well spent.'”
  • The Tyee : What’s Driving Chaotic Dismantling of Canada’s Science Libraries?
    • On the ongoing dismantling of government research libraries in Canada
    • “I saw a private consultant firm working for Manitoba Hydro back up a truck and fill it with Manitoba data and materials that the public had paid for. I was profoundly saddened and appalled.”

Research can be hard. That’s okay.

Joe Hardenbrook has written a great post on the issue of making (re)search exciting or fun, which you should all read.  It drives home some opinions that I bet a lot of us share, mainly being that research isn’t necessarily exciting.  And that any “excitement potential” held in a research project is dependent more on the subject matter and the researcher than it is in any tool at hand.  Kudos to Joe for speaking plainly and truthfully on this subject.

This “make research fun” issue is something that I often struggle with, especially when reading some LIS blogs and Twitter streams that veer into the subject.  “How do I make research exciting?”  “How can I make this assignment fun?”  “How can I keep them interested enough to see all of ProQuest’s search refinements?”  “Will they pay more attention in my one-shot if I launch into a stand-up comedy routine?”  I don’t buy into the argument underlying questions like those.  I’ll speak candidly here: I do not believe that it’s our job to make research “fun.”

I’m not a killjoy at work, and I promise that Steeleworthy’s Office Hours is not all doom and gloom, but neither am I some one who is going to pretend that research is something it’s not. Research is work.  Research can be frustrating.  Even for the best researchers on the planet, research can be difficult.  The advice and consultation I give can definitely help researchers of all kinds (students, faculty, staff, community members, etc) make their research more efficient by systematizing their work and improving their search precision, and this will make their work easier had they not spoken to me first. If our researcher’s excitement or interest is piqued in the research process – if they found cheap thrills as librarians and archivists do when they’ve found the needle in the haystack – then I’m thrilled and I know I’ve done my job. But “excitement” should not be our primary measure here.  Our main purpose (our main duty) is to collect the best of the world’s information, master its systems, and then help others access this information and master the systems themselves.

What it comes down to for me is how we present ourselves and our work to our users.  Our users don’t expect us to be fun-makers.  Our users come to us because they need help finding stuff.  If we can put a smile on their face along the way, then we’re doing them a service, but only if we’ve helped them find that information in the first place.

I’m going to end this by quoting Joe:

I’m a realist: Will they be excited? Chances are, no. But will they think the research process seems a little more doable and will they be willing to seek help? Yes.

We’re here to help people master research. In many ways, the library is where the job gets done at university. I really do hope that everyone enjoys their work, whatever their work is. And I also hope that everyone enjoys the classes they are enrolled in and the assignments they are working on. But those assignments are still work, and I don’t want to play down the effort required to get the job done.  I have an excellent demeanour with my users and I’m confident that many of them see me as a team player who will help them through their research. But I won’t lie to them. I’m not going to say that some one is going to have a blast writing an essay when really they just want to have watched last night’s season premiere of Revenge.  Instead, I speak honestly to my users. “Is this going to be work?” Yes, probably.  “Is it going to take some time?” Yes, likely.  “Is it going to be the worst thing to ever happen in my life?” Not at all.  “Are there tools and techniques I can use to make my work more efficient?” Yes, definitely.  “Can you help me?” Yes, obviously. I am already.

September Projects, 2013

I’m stating the obvious by telling you that September is a busy month in academics.  The start of the school calendar changes the mood, tempo, and pulse of a university campus, and it shifts things at the library into full gear as we roll out all programming and services. Here are a few of the things I’ve been contributing to lately, which has kept me busy in a good way (as opposed to the bad kind of busy).

Changing liaison duties

I’ve taken on liaison duties in Sociology and Social Work while a colleague is on sabbatical this year, and I’m also part of a group that is expanding the Library’s services to the University’s students who are cross-registered at the Balsillie School of International Affairs.  This has already translated to a large increase in my in-class instruction, especially for Social Work, which is its own Faculty and has its own small library. Taking on these subject-based duties has been a great opportunity since they’ve given me greater everyday contact with faculty and researchers whose work touches on socio-economic data or would benefit from research data management support and consultation.  Simply put, it’s a lot easier to push data management planning when you already have a built-in relationship with the researcher, so I expect my subject-based work in Sociology and Social Work to benefit our RDM programme.

Outreach to Faculty and Students

This term, I’m offering a full slate of seminars on research data management, bibliometrics, and data access through the library. I developed these seminars with a graduate student/faculty audience in mind, partly to help the Library increase its presence within graduate programming and in the university’s research enterprise. While I don’t expect large numbers because this is the first time in a few terms that we developed a suite of seminars with graduate or faculty research in mind,  I do hope they begin to build on our growing profile as a center of research facilitation on campus.  (This could be a blog post in its own right; I may have to write more on it in the future.)

Citation management

Our Library has taken a close look at RefWorks and has also considered what kinds of citation management systems our users use.  What we’ve known all along is that many people use RefWorks and many people do not.  We asked ourselves why we commit our support only to one service when our users will always work with their personal preferences in mind, and we decided that giving information and advice on RefWorks alone just doesn’t cut it.  If we are to support or know something about citation management and research management, then we shouldn’t limit ourselves to only one tool.  Going forward, our library is now supporting RefWorks, Zotero, and Mendeley by offering instructional sessions, consultation, and in some cases, even research collaboration between the researcher and the librarian through these tools

Data and Statistics

Don’t think that this project or the next (RDM) are subordinated to the others I’ve mentioned because they’re at the bottom of this list.  That’s far from the truth as both areas have seen significant change in the past few months.  On the Data and Stats side alone, data librarians in Canada are busy dealing with a new EULA for Canada Post postal code products (e.g., the PCCF) and what it means for researcher access and use. This issue alone has eaten up probably half of my time in the past two weeks since the new EULA changes long-standing practices for researchers who use these products, and the library, which administers licences on their behalf. A lot of time has been given to consultation within the data community and within the library to produce new practices, and I’m now rolling out a PR and education campaign.  If you are Laurier faculty and use postal code products, be on the look out for more news on this very shortly.  If you’re faculty at another university in Canada, you may want to contact your own data librarian.

Research data management

RDM has been the largest part of my work at Laurier.  We’ve been developing a research data management programme, based largely on consultative support that helps researchers and research groups learn about and then develop flexible data management plans that speak to their current and future research needs.  Next month, I’ll be attending CASRAI’s ReConnect2013 conference to build upon my current knowledge and to see and learn what others are doing in this area across Canada and in other jurisdictions.  We have a couple of RDM projects on the go already, and I would like to increase that number on campus since the service the Library provides offers clear benefits to the researchers in terms of meeting funding obligations, providing research management planning, and improving access to and citation of produced work after the research has ended.  Laurier researchers: let’s chat.

The Beastie Boys Library Sabotage Mashup

While I generally don’t get too geared up for zany library funtimes (because I want people to take me seriously since I do serious work), this video is definitely worth watching.  It mashes up one of the greatest bands of the late 20th century with a little LIS:

Happy Monday!  Now everyone go listen to favourite Beastie Boys album on rdio

 

Work with me: Limited Term Business Librarian Opening

The Laurier Library is searching for great candidates for another limited term posting.  We have a 1-year, limited term part-time appointment for a Business and Economics Librarian at our Waterloo campus.  Working with our other Business and Economics Librarian, the person who takes on this position will hit the ground running and be quite busy interacting with the School of Business and Economics – the largest faculty at Wilfrid Laurier University.

I promise that this would be an incredible learning opportunity for new or recent grads, and also a chance to gain and build upon your existing experience with business and economics librarianship. 

The Laurier Library : We Mean Business
The Laurier Library : We Mean Business

Business and Economics students at Laurier work with stats and data quite a bit, so I expect to collaborate often with who ever fills this position.  I’m happy to answer any general questions you might have about the posting, the university, and living in Waterloo, Ontario.

See our Business Librarian posting here.  The application deadline is August 5, so start moving on this one soon.

-Michael.

Recently published : Incorporating Online Instruction in Academic Libraries

Here’s my opportunity to influence all of your summer reading lists. The article that I co–authored with my colleague, Pauline Dewan, Incorporating Online Instruction in Academic Libraries: Getting Ahead of the Curve, has just been published in the Journal of Library & Information Services in Distance Learning.

These children are using a computer to help them solve their homework problems

This (with other reports, presentations, and articles), is the culmination of a large project Pauline and I undertook to assess our Library’s online teaching and learning programme in 2012. During the course of that project, we conducted a thorough literature review and analysis of the state of online instruction in our library, and generally in North America. The recommendations we made regarding goal formation, acquiring stakeholder buy-in, technological formats and delivery, and organizational change, can speak well to many libraries’ online teaching and learning programmes.

We also have an extensive bibliography that touches on policy, analysis, information literacy, and organizational behaviour – check this out because we’ve done a lot of heavy lifting, which may you help you in your own work.

Happy reading,

-m

Work with me: Limited Term Part-time Science Librarian opening

The Laurier Library is seeking out great candidates for our open limited-term, part-time Science Librarian posting. Click here for details.  I’m certain that the person who takes on this file will get a lot of classroom/instruction time and will see many research consultations – you’ll be busy, but it will be a Good Busy. 

The Library is accepting applications until July 22, 2013, so act fast!

 

(One more time: Click here for details.)

Laurier Librarians Getting the Job Done.
Laurier Librarians Getting the Job Done.

One does not simply walk into an RDC

It’s that time of year when more and more students are asking about accessing datasets for their research through our local Research Data Centre. And a couple times now, I’ve found myself having to explain that one does not simply walk into an RDC

One Does Not Simply Walk Into an RDC
One Does Not Simply Walk Into an RDC