Pausanias Analysis

Words and Phrases that Predict Skeptical vs. Non-skeptical Content

Model Performance Metrics

The following metrics are from the logistic regression classifier's performance on the test set:

Class Precision Recall F1-Score Support
Non-skeptical 0.850 0.764 0.805 445
Skeptical 0.551 0.683 0.610 189
Overall Accuracy 0.740 634

Predictors of Skeptical Content

These words and phrases are most strongly associated with skeptical content in Pausanias.

Word/Phrase Coefficient Skeptical Count Non-skeptical Count p-value q-value
δοκεῖν 3.3083 78 17 4.9e-29 2.45e-26
εἶναι 3.1848 299 376 4.51e-25 1.13e-22
ἐμοὶ 2.9970 77 16 3.65e-29 2.45e-26
οἶδα 2.8773 69 31 4.31e-18 8.62e-16
ἤκουσα 2.7953 36 4 1.87e-16 2.35e-14
γε 2.7931 136 125 6.96e-18 1.16e-15
λέγουσιν 2.7483 141 176 3.07e-11 2.36e-09
ἐμοὶ δοκεῖν 2.5523 65 13 4.49e-25 1.13e-22
λόγῳ 2.4811 61 53 5.02e-09 2.95e-07
μοι 2.1265 105 101 4.93e-13 4.93e-11
εἴτε 2.1097 16 3 4.95e-07 1.9e-05
οὐκ 2.0901 243 331 1.88e-16 2.35e-14
εἰ 2.0397 93 78 7.52e-14 8.35e-12
ὅτι 2.0252 117 138 1.87e-10 1.24e-08
φασιν 2.0051 104 127 8.63e-09 4.54e-07
γενέσθαι 2.0014 123 188 3.22e-06 0.000104
τις 1.9468 64 62 3.27e-08 1.49e-06
λόγον 1.8304 54 30 2.13e-12 1.83e-10
ἔπη 1.7843 48 23 2.19e-12 1.83e-10
εἰ δὲ 1.7241 33 13 5.39e-10 3.37e-08
ἄν 1.6055 33 18 3.75e-08 1.63e-06
οὐδέν 1.4722 39 46 0.000364 0.00597
φασὶν 1.4627 72 84 6.92e-07 2.47e-05
ὡς 1.4579 283 629 0.0146 0.0994
ἐγὼ 1.4301 36 14 7.28e-11 5.2e-09
οὐχ 1.4073 67 85 1.33e-05 0.000334
δὴ 1.4055 280 578 0.000391 0.0063
μνῆμα 1.3839 50 70 0.00101 0.0131
λόγος 1.3324 57 54 1.19e-07 4.97e-06
οὐ 1.3306 278 539 9.12e-06 0.000253

Predictors of Non-skeptical Content

These words and phrases are most strongly associated with non-skeptical content in Pausanias.

Word/Phrase Coefficient Skeptical Count Non-skeptical Count p-value q-value
δύο -1.4158 27 120 0.00577 0.049
ἅτε -1.3304 27 122 0.0043 0.0413
καὶ -1.2963 861 2230 0.464 0.719
τότε δὲ -1.2497 5 58 6.02e-05 0.00134
τὸ δὲ -1.2313 77 246 0.078 0.277
σφισι -1.1751 34 125 0.0521 0.222
θαλάσσῃ -1.1691 10 65 0.00217 0.0236
ἐπὶ τὴν -1.1638 12 58 0.0325 0.169
κεῖται -1.1581 7 62 0.000247 0.00449
μετὰ -1.1357 69 259 0.0025 0.0268
τούτῳ -1.1087 44 129 0.444 0.698
ποσειδῶνος -1.0981 16 66 0.0739 0.273
λακεδαιμονίων -1.0721 29 143 0.000471 0.00733
ἐκ τῶν -1.0453 13 63 0.0252 0.147
αὖθις -1.0393 36 142 0.0148 0.0997
τὴν μὲν -1.0390 6 36 0.0339 0.173
ὅπλα -1.0360 3 32 0.00753 0.0598
μάλιστα -1.0165 95 302 0.053 0.222
χώραν -1.0030 20 82 0.0482 0.22
τῶν -0.9962 431 1165 0.233 0.513
καὶ ἄγαλμα -0.9962 12 70 0.00375 0.0375
ἀνέθηκεν -0.9471 4 30 0.0343 0.173
ἐπὶ θαλάσσῃ -0.9196 3 39 0.00147 0.0174
μεσσηνίους -0.9186 11 48 0.0942 0.314
πρὸς -0.8981 163 500 0.0284 0.157
βασιλεὺς -0.8955 7 54 0.00162 0.0188
τοῖς δὲ -0.8898 7 54 0.00162 0.0188
ὑπὲρ -0.8742 50 173 0.0514 0.221
τῆς -0.8686 405 1152 0.0168 0.108
ἐνταῦθα -0.8630 120 346 0.249 0.532