Crossref Events

When someone links their data online, or mentions research on a social media site, we capture that event and make it available for anyone to use in their own way. We provide the unprocessed data—you decide how to use it.

Before the expansion of the Internet, most discussion about scholarly content stayed within scholarly content, with articles citing each other. With the growth of online platforms for discussion, publication and social media, we have seen discussions extend into new, non-traditional venues. Crossref Event Data captures this activity and acts as a hub for the storage and distribution of this data. An event may be a citation in a dataset or patent, a mention in a news article, Wikipedia page or on a blog, or discussion and comment on social media.

When someone links their data online, or mentions research on, for example, Twitter, Wikipedia, or Reddit, Crossref’s uses a set of APIs to captures and records those events in their ‘Event dataset’. Events are tracked via their DOI and URLs, which enables Crossref to monitor where it’s been shared, linked, bookmarked, referenced or commented on. Crossref Event Data currently contains events from a range of data sources, including Crossref Metadata, DataCite Metadata, F1000Prime (Recommendations of research publications, Hypothes.is, The Lens (Cambia), Newsfeed, Reddit, Reddit Links, Stack Exchange Network, Twitter, Wikipedia, and Wordpress.com

See the crossref events page, and data details, for more information.

The corresponding table created in BigQuery are crossref.crossref_events and crossref.crossref_events_partitions.

Summary

Average runtime

2 hours

Average download size

10 GB

Harvest Type

API

Harvest Frequency

Weekly

Runs on remote worker

True

Catchup missed runs

False

Table Write Disposition

Append

Update Frequency

Daily

Credentials Required

No

Uses Telescope Template

Stream

Latest schema

name

type

mode

description

id

STRING

REQUIRED

Unique ID for the Event.

subj_id

STRING

NULLABLE

Subject persistent ID.

relation_type_id

STRING

NULLABLE

Type of the relationship between the subject and object.

obj_id

STRING

NULLABLE

Object persistent ID.

timestamp

TIMESTAMP

REQUIRED

Timestamp of when the Event was created.

occurred_at

TIMESTAMP

REQUIRED

Timestamp of when the Event is reported to have occurred.

experimental

BOOL

NULLABLE

total

INTEGER

NULLABLE

source_id

STRING

REQUIRED

A name for the source.

source_token

STRING

NULLABLE

Unique ID that identifies the Agent that generated the Event.

terms

STRING

NULLABLE

Terms of use for using the API at the point that you acquire the Event.

license

STRING

NULLABLE

A license under which the Event is made available.

evidence_record

STRING

NULLABLE

Link to an Evidence Record for this Event.

subj

RECORD

NULLABLE

Subject metadata.

subj.pid

STRING

NULLABLE

The persistent ID. Must correspond to ‘subj_id’ or ‘obj_id’

subj.issued

TIMESTAMP

NULLABLE

Publication date.

subj.title

STRING

NULLABLE

The title of the webpage, comment, etc.

subj.author

RECORD

REPEATED

Author of the comment, blog etc.

subj.author.url

STRING

NULLABLE

subj.author.name

STRING

NULLABLE

subj.author.id

STRING

NULLABLE

subj.url

STRING

NULLABLE

URL where this was found. May be different to ‘pid’

subj.alternative_id

STRING

NULLABLE

subj.original_tweet_author

STRING

NULLABLE

subj.original_tweet_url

STRING

NULLABLE

subj.type

STRING

NULLABLE

subj.work_type_id

STRING

NULLABLE

subj.work_subtype_id

STRING

NULLABLE

subj.jurisdiction

STRING

NULLABLE

subj.api_url

STRING

NULLABLE

subj.publisher

RECORD

REPEATED

subj.publisher.url

STRING

NULLABLE

subj.publisher.name

STRING

NULLABLE

subj.publisher.id

STRING

NULLABLE

subj.publisher.type

STRING

NULLABLE

subj.json_url

STRING

NULLABLE

subj.name

STRING

NULLABLE

subj.datePublished

STRING

NULLABLE

subj.registrantId

STRING

NULLABLE

subj.dateModified

TIMESTAMP

NULLABLE

subj.id

STRING

NULLABLE

subj.proxyIdentifiers

STRING

NULLABLE

subj.funder

RECORD

NULLABLE

subj.funder.id

STRING

NULLABLE

subj.funder.type

STRING

NULLABLE

subj.funder.name

STRING

NULLABLE

subj.issueNumber

STRING

NULLABLE

subj.periodical

RECORD

NULLABLE

subj.periodical.id

STRING

NULLABLE

subj.periodical.issn

STRING

NULLABLE

subj.periodical.type

STRING

NULLABLE

subj.periodical.name

STRING

NULLABLE

subj.pagination

STRING

NULLABLE

subj.version

STRING

NULLABLE

subj.volumeNumber

STRING

NULLABLE

subj.includedInDataCatalog

RECORD

NULLABLE

subj.includedInDataCatalog.id

STRING

NULLABLE

subj.includedInDataCatalog.type

STRING

NULLABLE

subj.includedInDataCatalog.name

STRING

NULLABLE

obj

RECORD

NULLABLE

Object metadata.

obj.pid

STRING

NULLABLE

obj.url

STRING

NULLABLE

obj.method

STRING

NULLABLE

obj.verification

STRING

NULLABLE

obj.work_type_id

STRING

NULLABLE

obj.publisher

RECORD

REPEATED

obj.publisher.url

STRING

NULLABLE

obj.publisher.name

STRING

NULLABLE

obj.publisher.id

STRING

NULLABLE

obj.publisher.type

STRING

NULLABLE

obj.name

STRING

NULLABLE

obj.datePublished

STRING

NULLABLE

obj.registrantId

STRING

NULLABLE

obj.dateModified

TIMESTAMP

NULLABLE

obj.id

STRING

NULLABLE

obj.proxyIdentifiers

STRING

NULLABLE

obj.author

STRING

NULLABLE

obj.type

STRING

NULLABLE

obj.funder

RECORD

NULLABLE

obj.funder.id

STRING

NULLABLE

obj.funder.type

STRING

NULLABLE

obj.funder.name

STRING

NULLABLE

obj.issueNumber

STRING

NULLABLE

obj.periodical

RECORD

NULLABLE

obj.periodical.id

STRING

NULLABLE

obj.periodical.issn

STRING

NULLABLE

obj.periodical.type

STRING

NULLABLE

obj.periodical.name

STRING

NULLABLE

obj.pagination

STRING

NULLABLE

obj.version

STRING

NULLABLE

obj.volumeNumber

STRING

NULLABLE

obj.includedInDataCatalog

RECORD

NULLABLE

obj.includedInDataCatalog.id

STRING

NULLABLE

obj.includedInDataCatalog.type

STRING

NULLABLE

obj.includedInDataCatalog.name

STRING

NULLABLE

updated

STRING

NULLABLE

will have a value of ‘deleted’ or ‘edited’

updated_reason

STRING

NULLABLE

optional, may point to an announcement page explaining the edit

updated_date

TIMESTAMP

NULLABLE

ISO8601 date string for when the event was updated

message_action

STRING

NULLABLE

action

STRING

NULLABLE

jwt

STRING

NULLABLE