Pausanias Analysis

Greta sentence-level mythic vs. historical analysis

Surface, including books 4 and 8, with rhetoric markers

This model drops sentences tagged other and fits a balanced TF-IDF logistic regression to the remaining mythic and historical tags.

Model Performance Metrics

The following metrics are from the logistic regression classifier's performance on the test set:

Class Precision Recall F1-Score Support
Historical 0.610 0.669 0.638 248
Mythic 0.749 0.698 0.723 351
Overall Accuracy 0.686 599

Confusion Matrix

Predicted
Historical Mythic
Actual Historical 166 82
Mythic 106 245

Counts

2,393mythic/historical sentences
1,404mythic
989historical
1,000features

Predictors of Mythic Sentences

Sort by:
Word/Phrase English Coefficient Mythic Count Historical Count p-value q-value
εἶναι 2.3079 168 36 4.29e-14 4.29e-11
φασιν 1.7165 70 9 2.53e-09 8.43e-07
λέγουσιν 1.5723 121 24 3.85e-11 1.93e-08
ἐστιν 1.5705 122 39 2.52e-06 0.000315
φασὶν 1.5133 43 5 1.62e-06 0.00027
λέγουσι 1.3411 55 10 4.55e-06 0.000455
ἐστὶν 1.3015 55 20 0.00718 0.0876
πρῶτον 1.2349 44 11 0.000673 0.0204
δʼ 1.2317 32 4 0.00011 0.00551
αὐτοῦ 1.2258 45 16 0.0127 0.115
παῖδασ 1.2226 44 9 0.000118 0.0056
βωμὸσ 1.2179 17 0 0.000196 0.00853
ἔπη 1.1653 16 0 0.000374 0.0129
διὰ 1.1627 48 19 0.0254 0.161
ἱδρύσατο 1.1518 10 0 0.00694 0.0857
ὅτι 1.1294 51 16 0.00236 0.0429
ἔστι 1.0696 48 15 0.00309 0.0542
αὐτήν 1.0636 10 2 0.139 0.395
θυγάτηρ 1.0162 7 2 0.322 0.61
αὐτῶι 1.0105 65 22 0.00144 0.0327
ὄντα 0.9783 29 9 0.0212 0.157
λέγεται 0.9651 39 13 0.0127 0.115
λόγον 0.9611 16 3 0.0325 0.19
ὄνομα 0.9496 53 14 0.000327 0.0126
ἀρχαῖον 0.9395 18 3 0.0128 0.115
καὶ ὅτι 0.9352 13 0 0.00115 0.0278
λέγοντεσ 0.9172 17 1 0.00121 0.0282
αὐτῆι 0.9077 21 5 0.0157 0.133
φασι 0.8990 21 4 0.0126 0.115
καὶ εἶναι 0.8918 12 0 0.00207 0.039

Predictors of Historical Sentences

Sort by:
Word/Phrase English Coefficient Mythic Count Historical Count p-value q-value
τούτουσ -1.5637 6 13 0.0168 0.136
πόλεμον -1.5465 6 28 7.3e-07 0.000146
πολέμωι -1.5365 4 22 7.39e-06 0.000672
πολέμου -1.5144 7 24 3.78e-05 0.0023
ἄνδρασ -1.4347 5 16 0.0011 0.0275
οἰκίασ -1.3542 2 16 3.9e-05 0.0023
καὶ ἐσ -1.2835 38 55 0.000426 0.014
ἐναντία -1.2082 0 13 9.8e-06 0.000816
στρατιᾶι -1.2080 1 14 3.72e-05 0.0023
ναυσὶν -1.1911 3 22 3.13e-06 0.000348
μὴ -1.1389 19 25 0.037 0.207
βασιλέων -1.1204 0 12 2.39e-05 0.00171
πολλά -1.1150 0 6 0.00494 0.0726
ὅσον -1.1001 6 15 0.00508 0.0726
αὐτὸσ -1.0561 19 34 0.00072 0.0212
κόσμον -1.0367 0 7 0.00203 0.039
σφῶν -1.0357 0 9 0.000344 0.0128
ποτε -1.0251 12 14 0.197 0.472
ἤδη -1.0224 23 31 0.0163 0.135
χαλκοῦν -1.0075 2 9 0.0103 0.113
καὶ ἔτι -0.9827 0 8 0.000837 0.0226
ὃσ -0.9707 23 39 0.000534 0.0167
οὐ -0.9587 86 76 0.137 0.395
οὖν -0.9345 34 40 0.0252 0.161
πρώτοισ -0.9340 3 6 0.175 0.423
πλέον -0.9255 5 14 0.0041 0.0684
ὅσοι -0.9242 8 16 0.0119 0.115
κοινῶι -0.9161 2 7 0.0383 0.207
ἐρείπια -0.9035 1 5 0.0878 0.302
ἄλλα -0.8884 22 32 0.00737 0.0888