About BITIG Data

What This Is

BITIG Data is a curated digital showcase — not a complete dictionary, not a search engine, not a replacement for scholarly editions. It demonstrates that Old Uyghur materials can be:

Current Limitations

LimitationWhy
No Old Uyghur input methodThe Old Uyghur Unicode block (U+10F70–U+10FAF) was only standardized in 2021. No operating system or third-party IME supports it yet.
No audio / TTSOld Uyghur is a dead language (9th–14th century). There are no native speaker recordings and no speech synthesis models.
Curated samples onlyThe first edition includes ~5 manuscript texts and ~20 dictionary entries as proof of concept. It is not a comprehensive corpus.
Latin transcription search onlyWithout an Old Uyghur input method, search relies on Latin transcription. Browse-by-letter provides an alternative.
Placeholder imagesManuscript images require permission from holding institutions (Berlin-Brandenburg Academy, National Library of China, etc.). Placeholders are used pending authorization.
All data needs source auditEvery entry and segment must be traceable to a published scholarly source. Entries marked "needs source" are provisional.

Roadmap

  1. Short term: expand curated samples to 10+ texts; grow dictionary to 100+ entries with verified sources.
  2. Medium term: add interlinear glossing (morpheme-by-morpheme analysis); integrate with existing digital catalogues (Berlin Turfan Archive, IDP).
  3. Long term: build a community-editable transcription platform; develop Old Uyghur OCR assistance tools; establish a peer-review pipeline for data quality.

Technical

This site is a collection of static HTML pages with JSON data files. No backend, no database, no JavaScript framework. Hosted on Cloudflare Pages. All source data is stored in plain JSON files under data/. The site can be downloaded, modified, and redeployed by anyone.

Built for academic demonstration at the Old Uyghur Studies Conference, 2026.

Contact

[Contact placeholder — to be filled before conference]