[go: up one dir, main page]

Our APIs include 100M+ pages across 850+ Wikimedia projects and languages

Wikimedia Enterprise APIs

Snapshot API

Retrieve entire Wikimedia projects as a database dump file.

  • Download a compressed file containing everything in any project, in any language
  • Article body in HTML as well as Wikitext
  • Free: New snapshots every two weeks
  • Upgrade: Daily snapshots cadence
  • Upgrade: Beta Structured Contents Snapshot with parsed infoboxes and article sections
On-demand API

Retrieve single articles from any Wikimedia project at anytime.

  • Make standard HTTP requests to retrieve documents by ID or name from all projects and languages, or use filters to limit response
  • Request the latest page data anytime to augment your Realtime or Snapshot API data
  • Free: 5,000 requests monthly
  • Free: Infobox content and full article sections in parsed JSON with the beta Structured Contents endpoint
Realtime API

Stream updates in real-time from any Wikimedia project.

  • Streaming: Receive streaming updates (firehose) of every change as they occur in real-time
  • Batch: Download compressed snapshot files of incremental updates every hour
  • Instant updates for new content, any edits, deletions, and breaking news events including community-curated visibility changes
  • Contact sales for pricing

Gain instant access to Snapshot API and On-demand API with a free account.

Wikimedia Enterprise API Structured Data

All of our APIs return the same structured JSON (or ND-JSON) response format making it easy to augment one API with another.

Three APIs; same data, different retrieval methods.
_ Retrieve bulk data with the Snapshot API
_ Receive changes instantly with Realtime API streaming
_ Retrieve single articles with the On-demand API

API responses include article data such as summary, image, Wikidata QID, license, and more. Also included is data specific to the last revision, such as editor, size of change, and credibility score with revert probability.

Example: Run this cURL command with your access token (see docs to get that) to grab the live english squirrel wikipedia article seen here →

curl -H "Authorization: Bearer ACCESS_TOKEN" -L "https://api.enterprise.wikimedia.com/v2/articles/Squirrel?limit=1&fields=name&fields=identifier&filters=\{\"field\":\"is_part_of.identifier\",\"value\":\"enwiki\"\}"

For a full breakdown and explanation of the data schema in responses see our Data Dictionary.

More questions? – We’re here to help.

{
  "name": "Squirrel",
  "identifier": 28492,
  "abstract": "...summary of article...",
  "date_created": "2001-10-09T17:55:09Z",
  "date_modified": "2023-06-08T22:54:56Z",
  "date_previously_modified": "2023-05-11T22:49:51Z",
  "version": {
    "identifier": 1159205766,
    "comment": "...version edit comment...",
    "tags": [...],
    "is_minor_edit": true,
    "has_tag_needs_citation": true,
    "scores": {...},
    "editor": {...},
    "number_of_characters": 30705,
    "size": {
      "value": 30799,
      "unit_text": "B"
    },
    "event": {...}
  },
  "previous_version": {...},
  "url": "https://en.wikipedia.org/wiki/Squirrel",
  "watchers_count": 341,
  "image": {
    "content_url": "https://upload.wikimedia.org/wikipedia/commons/6/68/Sciuridae.jpg",
    "width": 600,
    "height": 600
  },
  "article_body": {
    "html": "...html of entire page...",
    "wikitext": "...wikitext of entire page..."
  },
  "namespace": {
    "identifier": 0
  },
  "in_language": {
    "identifier": "en"
  },
  "main_entity": {
    "identifier": "Q9482",
    "url": "https://www.wikidata.org/entity/Q9482"
  },
  "protection": [...],
  "additional_entities": [...],
  "categories": [...],
  "templates": [...],
  "redirects": [...],
  "is_part_of": {...},
  "license": [
    {
      "name": "Creative Commons Attribution Share Alike 3.0 Unported",
      "identifier": "CC-BY-SA-3.0",
      "url": "https://creativecommons.org/licenses/by-sa/3.0/"
    }
  ],
}

SLA and Support

Contracted accounts receive 99% SLA and support response time guarantees. All accounts have access to our help center faqs.

Open License

Benefit from open-licensed, royalty-free data. Every response includes license info.