Statistical Analysis - Overview

Generated: 2026-05-28 21:13:21 UTC

This site provides comprehensive statistical analysis of Stephanos of Byzantium's Ethnika. Select a section below to explore different aspects of the text.

1. Word Count Statistics

Explore word count distributions by entry type and starting letter. Includes normalized histograms with KDE curves and statistical tests comparing different entry types.

2. Translation Length Vocabulary

Identify Greek and English words associated with longer or shorter English translations after controlling for Greek source length.

3. Stephanos vs Epitomizer Emphasis

Discover what the original Stephanos emphasized versus what the Byzantine epitomizer emphasized. Interactive visualizations reveal what was lost in the epitome and what was added or expanded.

4. Etymology Analysis

Examine the distribution of etymology categories across the corpus, with comparisons between Delta and Non-Delta entries.

5. Parisinus Coislinianus 228 vs Epitomised version Comparison

Statistical comparison of word counts between entries from the original Stephanos (Delta) and the Byzantine epitome (Non-Delta).

6. Analysis by Category

Detailed analysis of how different categories of proper nouns correlate with entry length. Explore which authors, historical figures, places, ethnic groups, and deities Stephanos emphasized.

Authors Historical Figures Places Ethnic Groups Deities

7. Pausanias Citations

Analysis of Stephanos's citations of Pausanias the Periegete. Did Stephanos have access to the complete text of Pausanias, or only certain portions? Statistical analysis of citation distribution with links to the cited passages.

8. Guidance Rule Statistics

Daily statistics for translation-guidance rules: discovery estimates, Zipf-like rank frequency, and top-rule headword coverage.