Pausanias Analysis

Sentence-level words and phrases that predict skepticism vs. non-skepticism

Model Performance Metrics

The following metrics are from the logistic regression classifier's performance on the test set:

Class Precision Recall F1-Score Support
Non-skeptical 0.933 0.813 0.869 358
Skeptical 0.309 0.588 0.405 51
Overall Accuracy 0.785 409

Sentence Predictors of Skeptical Content

Word/Phrase Coefficient Skeptical Count Non-skeptical Count p-value q-value
λέγεται 5.0493 36 9 2.22e-23 2.22e-20
λέγουσιν 3.6427 40 37 3.82e-15 1.91e-12
φασιν 3.3174 20 12 4.18e-10 1.05e-07
λέγουσι 2.9490 27 18 1.34e-12 4.47e-10
φασὶν 2.9417 15 6 3.65e-09 7.3e-07
φασὶ 2.6642 12 2 5.94e-09 9.9e-07
εἶναι 2.4526 44 87 1.86e-08 2.65e-06
εἴτε 2.4226 7 1 9.92e-06 0.0011
ἐγὼ 1.9903 9 7 0.000111 0.0079
εἰ 1.9145 11 23 0.00912 0.207
ποιῆσαι 1.9089 7 4 0.000273 0.0152
οὐδένα 1.8459 6 7 0.00677 0.174
ὡς 1.8144 52 157 4.37e-05 0.00397
γενέσθαι 1.7717 24 37 1.74e-06 0.000217
ἄνδρα 1.7716 8 11 0.00377 0.13
φασι 1.6321 5 5 0.00847 0.197
τὴν 1.5932 99 400 0.000316 0.0166
ταύτην 1.5549 12 21 0.00195 0.078
ὀστᾶ 1.5044 5 4 0.00509 0.146
δοκεῖν 1.4617 9 8 0.000217 0.0128
δοκῶ 1.4613 7 2 3.9e-05 0.0039
νομίζουσι 1.4571 4 3 0.0112 0.229
ἔχει 1.4230 12 35 0.0506 0.562
σαφὲς 1.4079 5 2 0.0011 0.0522
ἀποθανεῖν 1.3882 11 15 0.000628 0.0314
νίσου 1.3834 4 2 0.00543 0.151
ὁμήρου 1.3812 6 5 0.00206 0.0794
οὐδέν 1.2654 7 11 0.0118 0.237
ὅστις 1.2063 4 12 0.277 0.984
ἐμοὶ δοκεῖν 1.1985 7 7 0.00181 0.0755

Sentence Predictors of Non-skeptical Content

Word/Phrase Coefficient Skeptical Count Non-skeptical Count p-value q-value
καὶ -1.5854 175 1140 0.0155 0.293
σφισιν -1.2774 3 56 0.0379 0.541
ὑπὲρ -1.1886 4 41 0.392 1
ἔτι -1.1052 7 82 0.0437 0.546
τούτοις -0.9906 1 22 0.236 0.969
ἱερὸν -0.9551 1 35 0.0527 0.572
δὲ καὶ -0.9315 19 161 0.0848 0.742
ὃν -0.8930 3 30 0.464 1
δὲ τῆς -0.8411 1 20 0.348 0.99
ἐστιν -0.8307 16 149 0.0453 0.546
ἦν -0.8228 13 89 0.554 1
μακεδόνων -0.8116 0 25 0.0405 0.546
πλησίον -0.7932 4 46 0.225 0.969
μετὰ -0.7627 4 61 0.0493 0.562
εἰσὶ -0.7479 1 20 0.348 0.99
πύρρος -0.7454 0 18 0.0941 0.759
ἀθηναίοις -0.7319 9 68 0.429 1
δὲ τοῦ -0.7284 3 38 0.262 0.984
τὸν παῖδα -0.7223 1 9 1 1
πολέμῳ -0.7212 0 14 0.245 0.977
λέσχεως -0.7207 0 11 0.385 0.99
διʼ -0.7185 1 21 0.235 0.969
παῖδες -0.6995 0 20 0.0601 0.629
διὸς -0.6813 1 25 0.162 0.956
ἀθηναῖοι -0.6799 3 41 0.194 0.969
ἐναντία -0.6723 0 14 0.245 0.977
ὕδωρ -0.6720 3 16 0.752 1
τότε -0.6670 3 57 0.0262 0.43
ἕλληνες -0.6648 2 17 1 1
γυναῖκες -0.6510 0 13 0.237 0.969