Guidance Rule Statistics

Generated: 2026-06-15 21:03:31 UTC

This page measures how the translation-guidance rules are behaving as reusable editorial knowledge. The denominators are explicit: scan coverage is based on headwords with guidance scan evidence, and rule coverage is based on active, non-retired rules with matched evidence.

Rules: 180
Scanned headwords: 3,570
Headwords with active rule matches: 1,112
Active rule incidences: 5,296

Introduction Date Basis

Rules imported from Gabe's spreadsheet can predate their database row. Estimated dates use the earliest Gabe-linked kappa human translation whose headword is matched by that rule; rules without that evidence are kept separate from actual timestamps.

Date basisRules
actual28
estimated62
unknown90

Estimated Rules by Introduction Date

DateNew estimated rulesCumulative estimated rules
2026-02-15 3 3
2026-02-19 10 13
2026-02-26 5 18
2026-02-27 2 20
2026-03-02 22 42
2026-03-12 2 44
2026-03-26 4 48
2026-04-02 2 50
2026-04-08 1 51
2026-04-23 10 61
2026-05-12 1 62

Butterfly Collecting Estimate

Here each guidance rule is treated like a species and each scanned headword is a collecting site. Rules that appear only once or twice drive the estimate of how many additional rules are likely still unseen.

Observed active firing rules: 99
Chao2 unseen rules: 5.00
Chao2 estimated total: 104.00

Chao2 currently estimates 5.00 unseen active firing rules beyond the 99 active rules already observed in guidance-match evidence.

Kappa Rule Accumulation

This uses the dated kappa human translations as the collecting sequence. Each step is one Gabe-linked kappa headword in timestamp order. The upper curve shows cumulative distinct active rules first observed in that sample; the lower panel shows new-rule bursts and the number of new rules seen in the trailing 10 sampled entries. The current sample observes 95 distinct active rules across 98 dated kappa entries.

MeasureValue
Observed active rules that fire at least once99
Rule incidences in scanned headwords5296
Rules seen in one headword10
Rules seen in two headwords10
Estimated total rules, Chao2104.00
Estimated unseen rules, Chao25.00
Estimated total rules, first-order jackknife109.00
Estimated total rules, second-order jackknife109.00
Sample coverage99.8%
Chance next rule incidence is a not-yet-seen rule0.2%

Estimate by Rule Kind

Kind Active rules Observed firing rules Singletons Doubletons Chao2 total Chao2 unseen Sample coverage
gloss 70 66 8 10 69.20 3.20 99.3%
formula 32 31 0 0 31.00 0.00 100.0%
proper_noun 75 2 2 0 3.00 1.00 0.0%

Zipf-Like Rank Frequency

Rules are ranked by the number of distinct headwords where they fire. A Zipf-like distribution would appear close to a straight line on the log-log plot, with slope near -1.

Fitted slope: -1.775
Alpha: 1.775
Log-log R²: 0.90
Slope p-value: 8.616e-51

Top-N Headword Coverage

This asks how much of the scanned corpus is covered if we keep only the most frequently firing rules.

Top N rules Covered headwords Coverage of scanned headwords Coverage of matched headwords Coverage of rule incidences
1 531 14.9% 47.8% 10.0%
3 751 21.0% 67.5% 27.4%
5 1,026 28.7% 92.3% 38.8%
10 1,076 30.1% 96.8% 60.3%
20 1,101 30.8% 99.0% 83.9%
50 1,112 31.1% 100.0% 96.5%

Top Firing Rules

Rank Kind Rule Preferred Matched headwords Occurrences Date basis Introduced
1 formula τὸ ἐθνικὸν + X (nominative ETHNONYM) Translate as "the ethnonym is 'X'" 531 771 unknown 2026-04-24
2 formula X (SETTLEMENT) + Y (genitive REGION) Translate as "a X in Y" 520 710 unknown 2026-04-24
3 formula X (SETTLEMENT) + Y (genitive PEOPLE) Translate as "a X of the Y" 399 511 unknown 2026-04-24
4 gloss χώρα ἡ region; territory (when belonging to a specific people, tribe, or nation); surrounding territory (when juxtaposed against a πόλις); land (remote or poetical places) 323 438 estimated 2026-02-26
5 formula X (nominative) + ὡς + Y (nominative HOMOMORPH) Translate as "'X' as in 'Y'" 281 423 unknown 2026-04-24
6 formula X (genitive ETYMON) + Y (nominative DERIVED NOUN) Translate as "Y is from 'X'" 265 374 unknown 2026-04-24
7 formula X (nominative DERIVED NOUN) + Y (nominative ETYMON) Translate as "'X' is from Y" 256 328 unknown 2026-04-24
8 formula ὡς + X (nominative ETYMON) + Y (nominative DERIVED NOUN) Translate as "(just) as 'Y' is from X" 214 331 unknown 2026-04-24
9 formula X (AUTHOR NAME) + Y (dative NUMBER) + Z (genitive BOOK NAME) Translate as "X, book Y in his *Z*" 203 257 unknown 2026-04-24
10 gloss ἐθνικόν τό ethnonym 203 218 estimated 2026-02-27
11 formula X (nominative PROPER NOUN) ... + ἀπό + Y (genitive ETYMON) Translate as "X... after Y" 197 283 unknown 2026-04-24
12 formula X (AUTHOR NAME) + ἐν + Y (dative BOOK NAME) Translate as "Χ, in his *Y*" 175 221 unknown 2026-04-24
13 formula X (nominative PROPER NOUN) + X (genitive PROPER NOUN) Translate as "X, X" 136 165 unknown 2026-04-24
14 formula ὁ πολίτης + X (nominative GENTILIC) Translate as "a citizen is an 'X'" 127 168 unknown 2026-04-24
15 formula ὡς + X (AUTHOR NAME) Translate as "as per X" 118 228 estimated 2026-02-26
16 formula X (AUTHOR NAME) + Y (NUMERAL) Translate as "X, book Y" 116 191 estimated 2026-03-02
17 formula Χ (AUTHOR NAME) + ἐν + Y (dative NUMBER) + Z (genitive BOOK NAME) Translate as "X, in book Y of his *Z*" 105 136 unknown 2026-04-24
18 gloss καλεῖται/ἐκαλεῖτο/κέκληται/ἐκλήθη (ἀπο...) is/used to be/is/was called/named after (+ ἀπο) 100 143 unknown 2026-04-24
19 formula X (nominative PROPER NOUN)... + ἀπό + Y (genitive ARTICLE + genitive ETYMON) Translate as "X... from the form 'Y'" 88 133 unknown 2026-04-24
20 formula ὡς + X (definite ARTICLE + ETYMON) + Y (nominative DERIVED NOUN) Translate as "just as 'Y' is from the name X" 88 119 unknown 2026-04-24
21 formula καί + X (nominative PROPER NOUN) + Y (nominative PROPER NOUN) Translate as "Y is also 'X'" 73 87 unknown 2026-04-24
22 gloss ἔθνος τό people 73 74 estimated 2026-02-27
23 formula X... + πρός + Y (dative) Translate as "X... near Y" 61 89 unknown 2026-04-24
24 formula X... πλησίον + Y. (genitive) Translate as "X... near Y" 44 66 unknown 2026-04-24
25 formula X (nominative DERIVED NOUN) + παρά + τό + Y (ETYMON) Translate as "X is deriving/derives from the form 'Y'" 41 52 actual 2026-05-13
26 gloss πολίτης ὁ citizen 41 41 estimated 2026-02-19
27 formula X (nominative AUTHOR NAME)... + ὡς αὐτός Translate as "X... as per X." 36 40 actual 2026-05-28
28 formula ὡς + X (AUTHOR NAME) + Y (dative NUMBER) + Z (genitive BOOK NAME) Translate as 'as per X, in book Y of his *Z*' 23 25 actual 2026-05-27
29 gloss ὄρος ὁ mountain, mount (X ὄρος = 'Mount X') 20 24 estimated 2026-02-19
30 gloss χωρίον τό locality; point (only in μέσα χωρία: ‘halfway point’) 17 25 estimated 2026-04-08
31 gloss γένος τό descent 17 17 estimated 2026-03-12
32 gloss κτητικόν τό possessive 15 16 estimated 2026-02-19
33 formula Y (EPITHET) + X (DEITY) Translate as "X Y" 14 22 estimated 2026-02-19
34 gloss οἰκήτωρ ὁ inhabitant, resident, patron (of a brothel...? - κ123) 14 14 estimated 2026-03-12
35 gloss γῆ ἡ land, dirt, Earth 13 19 estimated 2026-03-26
36 formula [NO ANTECEDENT] + ἀφ' οὗ + Y (DERIVED NOUN) Translate as "as per 'Y'" 12 25 estimated 2026-02-19
37 formula X (MASCULINE ETYMON) + ἀφ' οὗ + Y (DERIVED NOUN) Translate as "X... from/after whom is 'Y'" 12 23 estimated 2026-02-19
38 gloss ἄκρον τό cape (when on the coast, sgl.), headlands (when on the coast, plu.); peak (when inland) 12 22 estimated 2026-03-02
39 formula διὰ τοῦ + «X» (GREEK LETTER) + [FORM OF γράφειν] Translate as "writes/write/wrote/is written with X" 12 18 estimated 2026-02-15
40 gloss κώμη ἡ village 12 13 estimated 2026-05-12
41 gloss φυλή ἡ tribe 12 13 estimated 2026-03-02
42 gloss δῆμος ὁ deme 12 12 estimated 2026-03-02
43 gloss τοπικά τά locative term 12 12 estimated 2026-03-02
44 formula X (NEUTER ETYMON) + ἀφ' οὗ + Y (DERIVED NOUN) Translate as "X... from which is Y" or "X... as per 'Y'" 11 22 estimated 2026-02-19
45 gloss προσεχής -ες (+ dative) bordering, adjoining 11 12 unknown 2026-04-24
46 formula εἰς + «X» (GREEK LETTER) Translate as "ending in X" 10 14 estimated 2026-02-15
47 gloss δημότης ὁ deme-member 10 11 estimated 2026-03-02
48 gloss πόλισμα τό * town 9 12 estimated 2026-02-26
49 gloss ἄκρα ἡ promontory 9 11 estimated 2026-03-02
50 gloss ὁμωνύμως/ὁμώνυμος -ον homonymous, having the same name 9 11 estimated 2026-03-02