XRef

XML — eXtensible Markup Language — is the structural language of modern scholarly publishing. When a published article exists as XML, every element of the record is individually labeled and machine-readable: the title, each author's name and affiliation, the abstract, every section, every table cell, every reference, every figure caption, every inline citation.

A Word document or PDF is designed for human reading. A search engine, an indexing database, a citation tracker, or a full-text reader cannot reliably tell where the abstract ends and the introduction begins — or who the corresponding author is — without structured markup. XML closes that gap.

It converts a document that looks like a published article into one that behaves like a published article across every platform that consumes it: PubMed, PMC, institutional repositories, reference managers, and the journal's own full-text reader.

For journals that want to be indexed, discoverable, and preserved for the long term, XML is the format that makes everything else possible. XRef produces it without changing a word of the published record.

Assertion	What it checks	Why it matters
A1 / A3	Every inline citation marker is linked to a reference entry via `<xref>`	Plain-text citation markers break reference links in every downstream reader
A4 / A5	Every `<table-wrap>` has a label and caption; labels match source numbering	Mislabeled tables cannot be cross-referenced or cited
A10	Every `rid` attribute resolves to a matching `id`	Broken cross-references fail silently in validators but visibly in readers
A24	Every URL in the reference list is an `<ext-link>`, not plain text	Plain-text URLs cannot be resolved by reference managers or DOI resolvers
A27	At least one ISSN is present in `<journal-meta>`	Missing ISSN blocks PubMed and Crossref from resolving the journal record

The published article, fully structured

Structural assertions

JATS XML specimen

Why XML is the language of scholarly publishing

Two formats. Every scholarly destination.

Full-Text XML for Archives and Readers

Citation XML for Indexing and Discovery

What structured XML makes possible

Discoverability

PubMed and MEDLINE indexing

Long-term preservation

Reference linking

Accessibility

Platform independence

What XRef converts

Word → JATS Full-Text XML

Word → PubMed Citation XML

PDF → JATS Full-Text XML

PDF → PubMed Citation XML

From published file to validated XML

Upload the article

Verify metadata

Convert and structure

Validate and deliver

Every article passes 27 structural assertions

One scholarly quality platform, separate product surfaces

CheckRef

MetaRef

RefLens

ScholaRef

What publishers ask first

Convert your first article today