Pausanias Analysis

Sentence-level words and phrases that predict mythic vs. historical content

Model Performance Metrics

The following metrics are from the logistic regression classifier's performance on the test set:

Class Precision Recall F1-Score Support
Historical 0.756 0.719 0.737 499
Mythic 0.574 0.620 0.596 305
Overall Accuracy 0.682 804

Sentence Predictors of Mythic Content

Word/Phrase English Coefficient Mythic Count Non-mythic Count p-value q-value
φασιν they say 2.8646 69 15 2.16e-15 7.2e-13
λέγουσιν they say 2.3508 117 41 2.3e-18 1.15e-15
λέγουσι they say 2.3437 61 18 1.8e-11 4.51e-09
εἶναι to be 2.1190 199 89 2.08e-24 2.08e-21
ὑπὸ under; beneath; by (of agent) 2.0119 184 134 7.11e-11 1.18e-08
φασὶν they say 1.9424 39 12 1.39e-07 1.07e-05
μνῆμα memorial, monument (especially a tomb) 1.9027 39 17 8.84e-06 0.000402
λέγεται it is said 1.7737 49 21 4.67e-07 3.34e-05
παῖδεσ children 1.6908 27 12 0.000265 0.00647
ὄνομα name 1.6891 82 34 2.36e-11 4.71e-09
τάφοσ tomb 1.6758 24 14 0.00463 0.0445
μνῆμά memorial, monument, tomb 1.6355 17 7 0.00254 0.0296
αὐτὸν him 1.6260 90 46 7.56e-10 9.46e-08
γενέσθαι to become 1.6014 84 38 1.25e-10 1.79e-08
φασι they say 1.5523 24 5 2.56e-06 0.000135
παῖδασ children 1.5411 31 9 1.6e-06 0.0001
λέγοντεσ speaking 1.4883 14 1 2.85e-05 0.00102
θυγάτηρ daughter 1.4852 14 1 2.85e-05 0.00102
ὄντα being; things that are (beings) 1.4454 25 11 0.000417 0.00875
πρῶτον first; first of all 1.4381 44 15 8.36e-08 7.6e-06
ἀρχήν at first 1.4377 17 6 0.0011 0.0176
ἱδρύσατο he/she founded 1.3827 8 0 0.00071 0.0125
λόγον word 1.3563 22 5 1.14e-05 0.000449
θυγατέρα daughter 1.3504 32 6 2.23e-08 2.23e-06
αὐτὴν her 1.3362 50 35 0.000569 0.0109
ἔχειν to have 1.3241 18 6 0.000567 0.0109
ὅτι that; because 1.3173 52 42 0.00331 0.0376
ἀποθανεῖν to die 1.2864 30 6 1.02e-07 8.52e-06
πατρὸσ of the father; father’s 1.2604 14 3 0.000651 0.0121
γένοσ race; kind 1.2553 13 8 0.0473 0.21

Sentence Predictors of Historical Content

Word/Phrase English Coefficient Mythic Count Non-mythic Count p-value q-value
χαλκοῦν of bronze; brazen -2.0214 2 36 1.12e-06 7.46e-05
σφισιν to them -1.9376 26 87 7.13e-05 0.00198
μάλιστα especially; very much; certainly -1.6439 36 110 4.41e-05 0.00145
μὴ not -1.5268 16 37 0.119 0.363
αὐτῶν of them; their -1.4077 31 82 0.0034 0.0382
λίθου of a stone -1.3315 11 52 7.45e-05 0.00201
ἀγάλματα statues -1.3264 9 53 7.16e-06 0.000341
ναυσὶν to the ships -1.2777 5 28 0.0015 0.0214
θέασ of a view -1.2624 4 39 1.17e-05 0.000449
ἔξω outside -1.2490 1 10 0.0345 0.168
ἄλλωσ otherwise -1.1956 7 25 0.0252 0.137
ἔργον work; deed -1.1940 12 44 0.00233 0.0279
οἵ who -1.1867 1 12 0.02 0.117
καὶ ἐσ and into -1.1709 43 102 0.00599 0.0544
σταδίουσ stades -1.1667 1 17 0.00139 0.0208
στάδια stades -1.1430 0 19 6.43e-05 0.00189
χαλκοῦσ of bronze -1.1185 4 22 0.00848 0.0717
ἦσαν they were -1.1098 10 38 0.00362 0.0402
ἐστιν is -1.0388 127 288 1.17e-05 0.000449
που somewhere; perhaps; I suppose -1.0284 3 14 0.0802 0.271
χρυσοῦ of gold -1.0191 1 16 0.00234 0.0279
ἔτει in the year -1.0183 0 12 0.00232 0.0279
τρόπον manner -1.0160 3 17 0.0214 0.124
τέχνη art, craft, skill -0.9994 1 18 0.00144 0.0208
ἐστί is -0.9986 2 10 0.139 0.383
ἐναντία opposite; contrary -0.9869 3 16 0.0334 0.168
πολὺ much; very; by far -0.9829 10 35 0.00919 0.0718
ὅσα all the things that -0.9660 8 23 0.0861 0.284
ἔργου of work -0.9642 2 12 0.0557 0.229
ἔχοντεσ having -0.9637 3 15 0.0519 0.219