Stephanos Pipeline Progress

Last updated: 2026-04-16 20:34:38

Stage Done Total Progress % Rate ETA
AI Translation 1,035 3,569
29.0% 140/week 4.2 months
Human Review 118 1,035
11.4% 35/week 6.1 months
Proper Noun Extraction 1,035 1,035
100.0% 0/week complete
Structured Source Citations 936 936
100.0% 219/week complete
Wikidata Sources 408 408
100.0% 0/week complete
Etymology Extraction 1,035 1,035
100.0% 0/week complete
Alias Extraction 1,327 1,327
100.0% 0/week complete
Spelling Variants 9,162 9,162
100.0% 0/week complete
Human Entity Review 4,156 9,306
44.7% 0/week stalled
Wikidata Places 2,328 3,083
75.5% 70/week 2.5 months
Meineke Difference Analysis 1,292 2,001
64.6% 140/week 1.2 months
Text Pair Sync 3,569 3,569
100.0% 3,569/week complete
Translation Risk Sync 1,035 1,035
100.0% 1,035/week complete
nodegoat Sync 3,560 3,569
99.7% 0/week stalled
Note: ETA estimates are based on the processing rate over the past 7 days. "Stalled" means no progress in the last week. Proper-noun and etymology extraction are measured over translated lemmas, matching the nightly jobs. Wikidata stages count rows once they have a recorded outcome, including not-found, ambiguous, and human-reviewed cases. Retired stages such as finished Billerbeck OCR are intentionally omitted.

← Back to main site | Statistics | Downloads