Unsub – All Publishers Supported

Unsub is a dashboard that helps you reevaluate your big deal’s value and understand your cancellation options.

For the last few years we’ve supported a small set of very large publishers.

One of the most requested features has been support for more publishers.

As of today – right now – we support all publishers.

We heard you, and we’re super excited to get this in your hands. Here’s some important details:

  • All publishers are supported. We no longer support specific publishers, but rather we support any publisher.
  • A mix of publishers is supported. This was another oft requested feature, mostly related to aggregators, and actually naturally arose out of our change to support all publishers. Unsub dashboards no longer have logic filtering what titles are in your dashboard by publisher – so it’s just as easy for a dashboard to have titles from one publisher or 20 publishers.
  • Title prices are now required. Supporting all publishers, it’s not feasible for us to collect and update titles prices for all of their titles. For existing Unsub packages created before today, we’ve incorporated the public prices we had (for the big 5 we supported: Elsevier, Springer Nature, Wiley, Taylor & Francis, SAGE) into your packages. For new packages moving forward, you’ll have to upload your own title prices. We’ve updated the documentation accordingly.
  • APC report has moved from package to institution level. We have APC data for the big 5 publishers, but now that we’re moving to any publisher, we can no longer provide publisher specific APC reports. However, you can now get an APC report for your institution that includes an estimate of your APC spend for the big 5 publishers (Elsevier, Springer Nature, Wiley, Taylor & Francis, SAGE). See the APC Report documentation page for more.

But, we didn’t stop there. Here’s some additional features you can use today that we think you’ll enjoy:

  • Packages now have Descriptions. When you login to Unsub you’ll see evidence of this change straight away. You can use this package attribute to include a lot of detail about your package to remind your future self and others of important details about your package. See the docs for more information.
  • Package views now have an Edit Details tab. In this tab you can change the package name and description. See the docs for more information.
  • Packages have an optional filter setup step. This could be used for a variety of use cases, but first and foremost can be used to get back to the state of your package before today’s changes. That is, we no longer filter by publisher. If you had a Wiley package before today you should have only seen titles published by Wiley in your dashboard. However, moving forward, we do not filter by publisher, so that same Wiley package may include some titles from other publishers that were in your COUNTER reports. You can use this new feature to limit the set of titles that appear in your dashboard. See the Upload journal filter documentation page to learn more.

Notes:

  • During testing, we heard that aggregators may not provide a COUNTER 5 TR_J2 file. As we require a TR_J2 file if you choose COUNTER 5 in Unsub, we provide a fake TR_J2 file. Let us know if you run into any issues with this! See the docs page for more info.
  • As we support more publishers, we’ll run into more edge cases. We’ve heard that some publishers only provide a COUNTER 5 TR_J1 file – and do not provide TR_J2, TR_J3, and TR_J4 files. We don’t currently support the COUNTER 5 TR_J1 file. Get in touch if this is something you need.
  • There may be “growing pains” moving from support for 5 publishers to all publishers. For example, journal metadata that’s crucial to Unsub may not be complete for some journals. Please do get in touch if you run into any issues. We’ll be keeping an eye on things and will

If you are not a current Unsub subscriber and you’re interested to learn more schedule a demo or go ahead and purchase.

If you are a current Unsub subscriber, log in, kick the tires, and let us know what you think.

To learn more about all the new features head over to our documentation.

In an upcoming webinar (date to be announced soon) I’ll dive into all the new features and answer any questions.

OurResearch news: Heather stepping down

Hi everybody, this is Heather. I wanted to let you know I’m stepping down from OurResearch, effective mid-June 2022.

I’m so proud of what we’ve built over the last 10 years. I firmly believe the team will keep doing great things to advance open infrastructure in scholarly communications. My departure is on the most amicable of terms, and I will remain on the Board of Directors and OurResearch’s biggest fan.

Why leave? I’m ready for a change. This move has been in the works for some time. To start with I’ll take a few months off to rest and spend with my family (and cycle, read, and eat cookies) and then I’m not sure! 

Will keep this short and sweet because otherwise I’ll probably cry — building these ideas and tools with Jason has always been a labour of love. Wishing everyone the best. 

Rooting for the openiest of science ASAP,

Heather


Hey, this is Jason. This post is tough to write because I’d really like to say something profound and moving, something that expresses how much the last eleven years working with Heather have meant to me. Something that expresses how much I admire, respect, and love her. Something that conveys how OurResearch will always be incomplete without her–but how, at the same time, I’m 100% sure that we’ll continue to grow and prosper, thanks to the work she’s put in.

Now, I know that y’all know Heather is amazing. You know she’s smart and tough and kind and pragmatic and idealistic and authentic and clever and relentless and funny. You know that she’s put her heart and soul and love and self into Open Science and into OurResearch, and you know that she’s got a bigger heart and soul and love and self than just about anyone.

But y’all don’t know it like I know it.  I’ve seen it, up close, for eleven years. I’ve seen her on sleepless nights, when we had no money, when people were being mean, when servers were down, in the darkest and toughest of times. And I’ve never stopped being inspired by her. I’ve seen her perform code miracles and budget miracles and admin miracles and everything in between. And more than that: I’ve seen her do it with unflagging kindness, humility, and integrity. I’ve seen her as few have.

And I’m forever, deeply grateful for that: that I got to see her in action, be on her team, experience all the crazy highs and lows and sidewayses of cofounderdom with her. It’s been a profound honor.

So even more than I’ll miss Heather, I’m grateful for Heather. And I’ll be trying very hard to live up to her example, to practice all I’ve learned from her. Which means I’ll be working my guts out for OurResearch, because I believe in it with all my heart. We’ve got a great product in OpenAlex, a great team, a great board (including Heather still, huzzah!) and we’re going to be doing great things. I know that’s what Heather wants, and it’s what I want, and by golly we’ll do it. 

I’ll miss you, Heath. Thanks for a great decade. We won’t let you down.

j

New OpenAlex API features!

We’ve got a ton of great API improvements to report! If you’re an API user, there’s a good chance there’s something in here you’re gonna love.

Search

You can now search both titles and abstracts. We’ve also implemented stemming, so a search for “frogs” now automatically gets your results mentioning “frog,” too. Thanks to these changes, searches for works now deliver around 10x more results. This can all be accessed using the new search query parameter.

New entity filters

We’ve added support for tons of new filters, which are documented here. You can now:

  • get all of a work’s outgoing citations (ie, its references section) with a single query. 
  • search within each work’s raw affiliation data to find an arbitrary string (eg a specific department within an organization)
  • filter on whether or not an entity has a canonical external ID (works: has_doi, authors: has_orcid, etc)

Request multiple records by ID at once

This has been our most-requested feature and we’re super excited to roll it out! By using the new OR operator, you can request up to 50 entities in a single API call. You can use any ID we support–DOI, ISSN, OpenAlex ID, etc.

Deep paging

Using cursor-based paging, you can now retrieve an infinite number of results (it used to be just the top 10,000). But remember: if you want to download the entire dataset, please use the snapshot, not the API! The snapshot is the exact same data in the exact same format, but much much faster and cheaper for you and us.

More groups in group_by queries

We now return the top 200 groups (it used to be just the top 50).

New Autocomplete endpoint

Our new autocomplete endpoint dead easy to use our data to power an autocomplete/typeahead widget in your own projects. It works for any of our five entity types (works, authors, venues, institutions, or concepts). If you’ve got users inputting the names of journals, institutions, or other entities, now you can easily let them choose an entity instead of entering free text–and then you can store the ID (ISSN, ROR, whatever) instead of passing strings around everywhere. 

Better docs

In addition to documenting the new features above, we’ve also added lots of new documentation for existing features, addressing our most frequent questions and requests:

Thanks to everyone who’s been in touch to ask for new features, report bugs, and tell us where we can improve (also where we’re doing well, we’re ok with that too).
We’ll continue improving the API and the docs. We’re also putting tons of work into improving the underlying dataset’s accuracy and coverage, and we’re happy to report that we’ve improved a lot on what we inherited from MAG, with more improvements to come. We’ve delayed the launch of the full web UI, but expect that in the summer…we are so excited about all the possibilities that’s going to open up.

Unsub Webinar Series

We’re starting an Unsub (https://unsub.org/) webinar series next week!

Why would you want to attend? These webinars should help you get better value from Unsub regardless of whether you want to just understand your options, get a better deal on your big deal, or cancel your big deal. 

Every two weeks we’ll cover a new topic, with two time slots for each topic to serve a wider array of time zones: morning and afternoon PST (Pacific Standard) time.

If our webinar times don’t work for you, we are planning to record webinars and upload them for anyone to watch on Vimeo (https://vimeo.com/unsub).

Here are the first three topics we’ll cover:

  • Feb 8 & 10: Unsub demo – an overview of the product
  • Feb 22 & 24: Eric Schares demoing Unsub Extender
  • Mar 8 & 10: Deep dive on Unsub scenarios

Other topics are in the works – we’ll announce them soon. Let us know here, elsewhere, or email me (scott@ourresearch.org) if there’s any topics you’d like covered in our webinar series.

The webinar series is free. However, we will require registration so we know how many people are coming and to make it easier for you to remember to attend (i.e., Zoom email confirmation, add to your calendar, etc). 

Our first webinar is titled Unsub Demo – An Overview of the Product – Feb 8 and 10:

We’ll put out registration links soon for subsequent webinar topics.

OpenAlex Update: Jan 24 2022

The OpenAlex launch is going well!  Thanks for all of your feedback, comments, questions, and help spreading the word.  A few updates for you below.

Snapshot updates

There is a new native-format snapshot, with the following updates:

  • includes “abstract_inverted_index” in works
  • includes “raw_affiliation_string” in works.authorships (thanks for requesting this!)
  • includes “cited_by_api_url” in works is now a string not a list (sorry! the list was a bug)
  • corrected the spelling of institution.associated_institutions
  • “ids” dict doesn’t include entries for empty ids anymore (simplifies the data)

This new snapshot doesn’t have additional new works since the previous one, but we expect new works to be added in the next week, and approximately every 2 weeks after that.  A new MAG-format snapshot including new works will also be release at that time.  Each new snapshot will contain articles published up to just a few days before the snapshot release (rather than several weeks old as was the case with MAG). 

API updates

The same changes as described above for the snapshot, importantly including the “abstract_inverted_index” in the list and filter endpoints.

Nature write-up

The OpenAlex launch was covered in Nature this week!  You can read about it here:  https://doi.org/10.1038/d41586-022-00138-y  
We are really happy to hear that people are finding it easy to use!

OpenAlex Tips of the Day

We have been posting tips for using OpenAlex on Twitter every weekday.  

You can see past tips at this search link (whether you have a twitter account or not), and you can follow us on twitter here: @openalex_org

Questions?

We’d love to hear from you: team@ourresearch.org

OpenAlex launch!

OpenAlex launched this week! (January 3rd 2022 for those reading from the future 🙂 )

As expected:

We’re now pulling in new content on our own. Until now, we’ve been getting new works, authors, and other entities from MAG. Now that MAG is gone, we’re gathering all of our own data from the big wide internet.

The new REST API is launched! This is a much faster and easier way to access the OpenAlex database than downloading and installing the snapshot. It’s completely open and free–you don’t even need a user account or token.

We’ve now got oodles of new documentation here: https://docs.openalex.org/

Slight change of plan:

The MAG Format snapshot is now hosted for free, thanks to the AWS Open Data program. This will cover the data transfer fees (which turned out to be $70!) so you don’t have to. Here are the new instructions on how to download the MAG format snapshot to your machine.


We are extending the beta period for OpenAlex; we’ll emerge from beta in February. This is mostly in response to discovering issues with the coverage and structure of existing data sources including MAG. Extending the beta reflects the fact that the data will improve significantly between now and February.

Huge exciting news:

OpenAlex was built to offer a drop-in replacement for MAG. We’re doing that. But today, we’re also unveiling some moves toward a more innovative future for Openalex:

We’ve now built around a simple new five-entity model: works, authors, venues (journals and repositories), institutions, and concepts. Everything in OpenAlex is one of these entities, or a connection between them. Each type of entity has its own API endpoint.

We’ve got a new Standard Format for the snapshot, one that’s closely tied to both the five-entity model the API. In the future, this will become the only supported format. The MAG format is now deprecated and will go away on July 1, 2022.

In conclusion:

Thanks for your support, and please send us any feedback you find! In particular, let us know about bugs…it’s early days, and there will be plenty. We’re currently fixing these very quickly. Happy New Year, and happy OpenAlexing!

Best,
Jason and Heather

OpenAlex data now in beta

We’re thrilled to announce our beta release of OpenAlex! You can learn more and download the inaugural data dump on the website at https://openalex.org.

Keep in mind that we’re in early beta — we recommend against using OpenAlex in production contexts until our official release Jan 3.

This beta release is aimed mostly at helping existing MAG users get started with their migration efforts; users without MAG experience may find the documentation a bit light. We’ll be adding a lot of new documentation over the next few weeks.

If you find bugs or have feature requests, please let us know! team@ourresearch.org, we’d love to hear from you.

Joining OurResearch to work on Unsub

Why OurResearch?

I’m thrilled to have landed a job with OurResearch working full-time on Unsub. When I was looking for a job this summer I wanted a new experience; I wanted to be challenged and to learn new skills – Unsub was the perfect match. With respect to coding, I moved from 100% R programming to 100% Python. In addition, the domain (tools for librarians) is very different from my previous job (open source software for researchers) – just the big change I wanted. 

Academic Libraries

Despite coming into this job without experience working as a librarian, I’ve always deeply appreciated libraries and the work librarians do. During my time in academia (bachelors through post-doc) I benefited a lot from various university libraries (Rice University and Simon Fraser University, to name a few), and experienced the technological change from print to electronic as ILL requests first came in hardbound and printed form, then transitioned to electronic forms. I’m excited to be able to help librarians after benefiting from their work for so many years.

What I’ll work on

As the Unsub product owner I’ll make decisions about features, implement those features, fix bugs, do demos for librarians, and of course do lots of support. I’m excited to make Unsub the best tool for librarians to reevaluate big deals and understand their cancellation options. 

Challenges and opportunities

The biggest challenge I see in maintaining Unsub is making sure our forecasts are as accurate as possible. I’ve learned already that it can be difficult to keep track of what publishers are doing with respect to big deals, title by title prices, etc. 

There’s a big, neh huge, opportunity here to push scholarly literature much further towards open access – while at the same time freeing up library budgets to support more collaborative players in the scholarly publishing community.

MAG replacement update: meet OpenAlex!

Last month, we announced that we’re launching a replacement for Microsoft Academic Graph (MAG) this December–just before MAG itself will be discontinued.  We’ve heard from a lot of current MAG users since then. All of them have offered their support and encouragement (which we really appreciate), and all have also all been curious to learn more. So: here’s more! It’s a snapshot of what we know right now.  As the project progresses, we’ll have more details to share, keeping everyone as up-to-date as we can.

Name

We’ve now got a name for this project: OpenAlex. We like that it (a) emphasizes Open, and (b) is inspired by the ancient Library of Alexandria — like that fabled institution, OpenAlex will strive to create a comprehensive map of the global scholarly conversation. We’ll start with MAG data, and we’ll expand over time. Along with the name, we’ve got the beginnings of a webpage at openalex.org, and a Twitter account at @OpenAlex_org.

Mailing list

We’ve now got a mailing list where you can sign up for more announcements as they happen. You can sign up for the mailing list on the new OpenAlex homepage.

Funding

Our nonprofit OurResearch recently received a $4.5 million grant from the Arcadia Fund, a charitable fund of Lisbet Rausing and Peter Baldwin. This grant has been in the works for some time, and is a big part of why we felt confident in announcing OpenAlex when we did. In the proposal, there was already a plan for a project similar to OpenAlex, so we were able to quickly pivot the grant details to direct about a million dollars to the development of OpenAlex. 

It’s a three year grant, which will give us plenty of time to develop and launch OpenAlex, as well as test and launch a long-term revenue. This model will not be built on selling data (see Openness below), but rather based on selling value-added services and service level agreements. We’ve got experience with this approach: we’re funding Unpaywall this way, and it’s been both open and fully self-sustaining for several years now.

Openness

We’re passionate about openness. It’s the “Our” in our name–we think research should belong to all of us, as humans.  Openness is the first of core values, and it’s a big piece of our recent public commitment to the Principles of Open Scholarly Infrastructure (POSI). A lot of our excitement about OpenAlex comes from the chance to make this rich dataset unprecedentedly open.  Specifically:

  • The code will be open source under an MIT license, hosted on our GitHub account and backed up by Software Heritage.
  • The data will be as openly licensed as possible. Some of the data consists of facts, which have no copyright (see this Crossref post for more about that idea). Where copyright is applicable, and where we have the option, we’ll apply the CC0 waiver. Where other rightsholders are involved, we will encourage them to allow a similarly open license.
  • The data will be free, as in no cost. It will be available via a free API (more details below) with generous limits, as well as periodic data dumps available at no charge (we may require the downloader to cover the 3rd party data transfer fees if these get heavy).

Data we are losing (at least to start)

As mentioned in our initial announcement, OpenAlex will be missing some data that MAG currently has–particularly at our launch in 2021, due to the very tight timeline. More accurately, we’ll have this data, but won’t be keeping it up to date. Specifically we won’t have: 

  • Conference Series and Conference Instances. Importantly, we’ll continue to bring in the vast majority of conference papers. But won’t be keeping track of  new conferences themselves (eg, The 34th Annual Conference of Foo), and with that the ability to link conference papers to those conferences.
  • Citation Contexts (the full text of the paragraph where each citation originally appeared)
  • Most abstracts. We will however probably have those (minority of) abstracts that publishers send Crossref or PubMed for redistribution.
  • Full coverage of DOI-unassigned works:. MAG is particularly good at finding scholarly papers without a DOI. We’ll be less good, especially at first. We will include many DOI-unassigned works…just not as many as MAG.

There is some other data that may or may not make it into OpenAlex by December 2021. We are still testing these for feasibility:

  • Patents
  • Paper recommendations

Data we are adding

Although we’ll be missing some data, we’ll also be bringing some new data to the party — stuff MAG doesn’t have right now. Specifically will include:

  • The Open Access status of papers (via the Unpaywall dataset, which has become the industry standard). We’ll be able to tell you whether a given paper is OA or not, its copyright license, and where to find it. 
  • A more comprehensive list of ISSNs associated with each journal, including the ISSN-L, which is helpful for deduplicating journals.
  • ORCID for author clusters. To start with, this will just be in unambiguous cases, when assignment is clear via the Crossref and ORCID datasets. Over time we may apply fancier, more inferential assignments.
  • ROR IDs for institutions, in addition to GRIDs

Over the long term, our goal with OpenAlex is to create a truly comprehensive map of the global scholarly conversation, so we’ll be continually looking to  expand and enhance the data it includes.

Data dumps

There will be (at least) two ways to get at the data: data dumps, and the API (below). 

Data dumps will be in the same table/column format as the MAG data, so that the downloads can be a drop-in replacement. There may be some additional tables and additional columns for new data we’re adding, and some data values will be missing (both of these are described above), but if you’re running code to ingest MAG dumps right now, you’ll be able to run pretty much the same code to ingest OpenAlex dumps in December. That’s a really important part of this project for us, because we know it will save a lot of folks a lot of time.

We will release new data dumps every 2 weeks, as either a full dump or an incremental update or both (we’re still looking into that). The data will likely be hosted on AWS S3 rather than Microsoft Azure.

API 

The other way main to get at the data will be via the API. Here we will be doing it pretty differently than Microsoft. We will not be supporting the Microsoft Academic Knowledge API or Microsoft Academic Knowledge Exploration Service (MAKES). Instead, we will host a central, open REST API that anyone can query. This API will have two kinds of endpoints: entity endpoints, and slice-and-dice endpoints. Both will be read-only (GET), deliver data in JSON format, and be rate-limited but with high rate-limits. 

  • Entity endpoints will let you quickly retrieve a specific scholarly entity (eg paper, person, journal, etc) by its ID. Signatures will look like  /doi/:doi and /journal/:issn.
  • Slice-and-dice endpoints will let you query the data with filters to return either item lists, or aggregate group counts. An example call might look something like /query?filter=issn:2167-8359,license:cc-by&groupby=year (that would give you the annual counts of CC-BY-licensed articles from the journal PeerJ). You could also use the slice-and-dice endpoints to do things like build a faceted scholarly search engine, or an evaluation tool.

Timeline

We appreciate that having a scheduled beta (or alpha!) release of the API and data dump would be very helpful. And we further realize that the sooner we can let you know that schedule, the better. Unfortunately, we don’t know the timeline for these releases yet. Our current best guess is the early fall. We’ll certainly be doing our best to get something pushed out there as soon as possible. We encourage you to  join the mailing list so we can keep you up to date. 

Your comments

Finally, we welcome your comments and questions! We’ve gotten oodles of helpful feedback already, and we really appreciate that. We’re especially interested in getting your current use-case for MAG…we’re working to prioritize supporting those cases, first and foremost.  You can do that via our community survey here, or drop us a line at team@ourresearch.org.

Open Science nonprofit OurResearch receives $4.5M grant from Arcadia Fund

OurResearch, a nonprofit seeking to speed the global adoption of Open Science, announced today that it had been awarded a new 3-year, $4.5M (USD) grant from the UK-based Arcadia, a charitable fund of Lisbet Rausing and Peter Baldwin.

The grant, which follows an 2018 award for $850,000, will help expand two existing open-source software projects, as well as support the launch of two new ones:

  • Unpaywall, launched in 2017, has become the world’s most-used index of Open Access (OA) scholarly papers. The free Unpaywall extension has 400,000 active users, and its underlying database powers OA-related features in dozens of other tools including Web of Science, Scopus, and the European Open Science Monitor. All Unpaywall data is free and open.
  • Unsub is an analytics dashboard that helps academic libraries cancel their large journal subscriptions, freeing up money for OA publishing. Launched in late 2019, Unsub is now used by over 500 major libraries in the US and worldwide, including the national library consortia of Canada, Australia, Greece, Hong Kong, and the UK. 
  • JournalsDB will be a free and open database of scholarly journals. This resource will gather a wide range of data on tens of thousands of journals, emphasizing coverage of emerging open venues. 
  • OpenAlex will be a free and open bibliographic database, cataloging papers, authors, affiliations, citations, and journals. Inspired by the ancient Library of Alexandria, OpenAlex will strive to create a comprehensive map of the global scholarly conversation.  In a recent blog post, the team announced that OpenAlex will be released in time to serve as a replacement for Microsoft Academic Graph, whose discontinuation was also recently announced.

OurResearch’s ongoing operations costs (about $1M annually) are currently covered by earned revenue from service-level agreements. The new funding will go toward accelerating development of new features and tools.

The new tools and features will be developed in keeping with OurResearch’s longstanding commitment to openness. OurResearch recently became one of the first to commit to the Principles of Open Scholarly Infrastructure (POSI), a set of guidelines encouraging openness, sustainability, and responsive governance. OurResearch has always fully shared its source code and datasets, and maintains a transparency webpage publishing salaries, tax filings, and other information. The proposal for this grant is itself shared on Open Grants.

“We are very grateful to the Arcadia Foundation for this grant, which will help us innovate more quickly than ever before. There is an urgent need for open scholarly infrastructure,” said Heather Piwowar, one of OurResearch’s two cofounders. 

“Since our beginning at a hackathon ten years ago, we’ve been working to build sustainable, open, community-oriented software tools to make research more open,” added her cofounder Jason Priem. “We’re so excited about the ways this grant will help us further that vision.” 

Work on the grant is expected to begin at once, with early versions of both JournalsDB and the OpenAlex launching later this year.

———————————-

OurResearch is a nonprofit that builds tools to help accelerate the transition to universal Open Science. Started at a hackathon in 2011, they remain committed to creating open, sustainable research infrastructure that solves real-world problems.

Arcadia is a charitable fund of Lisbet Rausing and Peter Baldwin. It supports charities and scholarly institutions that preserve cultural heritage and the environment. Arcadia also supports projects that promote open access and all of its awards are granted on the condition that any materials produced are made available for free online. Since 2002, Arcadia has awarded more than $777 million to projects around the world.