Stephanos Pipeline Progress

Last updated: 2026-04-12 20:10:53

Stage Done Total Progress % Rate ETA
AI Translation 1,035 3,569
29.0% 140/week 4.2 months
Human Review 91 1,035
8.8% 44/week 5.0 months
Proper Noun Extraction 1,035 1,035
100.0% 0/week complete
Structured Source Citations 936 936
100.0% 156/week complete
Wikidata Sources 408 408
100.0% 0/week complete
Etymology Extraction 1,035 1,035
100.0% 0/week complete
Alias Extraction 1,327 1,327
100.0% 0/week complete
Spelling Variants 9,162 9,162
100.0% 0/week complete
Human Entity Review 4,156 9,306
44.7% 0/week stalled
Wikidata Places 2,296 3,083
74.5% 70/week 2.6 months
Meineke Difference Analysis 1,212 2,001
60.6% 140/week 1.3 months
Text Pair Sync 3,569 3,569
100.0% 3,569/week complete
Translation Risk Sync 1,035 1,035
100.0% 1,035/week complete
nodegoat Sync 3,560 3,569
99.7% 0/week stalled
Note: ETA estimates are based on the processing rate over the past 7 days. "Stalled" means no progress in the last week. Proper-noun and etymology extraction are measured over translated lemmas, matching the nightly jobs. Wikidata stages count rows once they have a recorded outcome, including not-found, ambiguous, and human-reviewed cases. Retired stages such as finished Billerbeck OCR are intentionally omitted.

← Back to main site | Statistics | Downloads