Methodology

This page documents how each translation on the site is produced, evaluated, and labelled. It is the public companion to the Translation Status Contract in the repository.

How we translate

The pipeline is Sanskrit-grounded, not anchor-grounded. For each verse, the translation model receives the Devanāgarī text, an IAST transliteration, the Vidyut morphological segmentation (lemmas, case, number, gender, sandhi splits), Cologne C-SALT glosses per lemma from Monier-Williams and Apte, the Skrutable meter tag, and the preceding two verses for citation-aware context. A public-domain English translation may be passed as a reference signal — explicitly marked as such — but is never the source of truth. When the English anchor and the Sanskrit morphology disagree, the model is instructed to trust the Sanskrit.

The LLM-as-judge

Every draft is scored 1–10 for Sanskrit fidelity by a second model running the same input bundle (Devanāgarī + morphology + glosses + meter + context). The judge returns a score, a rationale, and a list of concerns; these are stored alongside the translation row.

Score ≥ 7 — promote from draft to published; visible on the site behind the AI · not verified badge.
Score ≥ 8 — flagged high-confidence (a subtle badge variant lands in V1.x).
Score < 7 — stays draft; excluded from public display until revised or human-reviewed.

Calibration is checked quarterly by human spot-audit of 50 random published translations per language. Audit outcomes feed back into prompt versions, which are themselves recorded per translation row (prompt_version) so any score is reproducible.

AI-assist badge legend

Each translation carries a badge inline with its verse number. The badge state is derived from two fields: ai_assisted (boolean) and status (draft | published | reviewed). The full state table — including badge colours, public-display rules, and the V1.1 five-state expansion — lives in the Translation Status Contract . License terms for the translated text itself are documented on the License page.

Why no commentary is glued under verses

The classical auto-commentaries — Kṣemarāja's Śivasūtravimarśinī on the Śiva Sūtras, Abhinavagupta's Tantrāloka-viveka, Jayaratha's commentary on the Tantrāloka — are full texts in their own right. They are not annotations to be folded under the base sūtras. We ingest each one as a sibling text under the same verse-anatomy schema. The Vimarśinī, for example, lives at its own route (/trika/siva-sutras-vimarsini), not nested under the Śiva Sūtras. Readers who want both can open them side by side; the structural hierarchy of the site does not imply that the commentary is subordinate.

Public-domain English anchors

Pre-1930 translations by John Woodroffe (Arthur Avalon), Ralph Griffith, George Thibaut, Max Müller, W. D. Whitney, and J. H. Woods are present in the pipeline as reference signals only. They are never the ground truth, never displayed unattributed, and never used to override Sanskrit morphology when the two conflict. Full per-translator attribution lives on the Sources page.

Editorial policy

Per-text source-of-truth is recorded in the texts.source_revision column — the exact GRETIL or Muktabodha or sanskritdocuments.org revision we ingested, with date. Critical-edition emendation (proposing variant readings, reconciling manuscript traditions) is out of scope for V1; we publish the source revision as-is and link the upstream so scholars can verify. V1.x will add a variants column for recorded alternate readings without committing to an emended text.

Reviewer policy

Human reviewers — when V1.x opens the review pipeline — work from the Sanskrit and morphology, never from copyrighted modern translations. Pasting from Jaideva Singh, Mark Dyczkowski, Lilian Silburn, Alexis Sanderson, or any other in-copyright translator is not permitted. Reviewers may consult those works for orientation; the accepted English (or other-language) wording must be the reviewer's own and grounded in the source.

Known limitations

V1 ships AI-only translations behind explicit badges. No translation on the public site is human-verified yet.
The judge has not been audited at scale; calibration is best-effort until the first quarterly spot-check.
Coverage is uneven — some texts have richer Cologne lemma coverage than others; the model receives whatever the corpus offers.
Eleven Indic target languages are planned; only English is published at launch (per the language picker, others are explicitly marked soon).

Last revised: 2026-05-31 · Source: STATUS-CONTRACT.md and the V1 plan document.