Words and Phrases that Predict Skeptical vs. Non-skeptical Content
The following metrics are from the logistic regression classifier's performance on the test set:
| Class | Precision | Recall | F1-Score | Support |
|---|---|---|---|---|
| Non-skeptical | 0.850 | 0.764 | 0.805 | 445 |
| Skeptical | 0.551 | 0.683 | 0.610 | 189 |
| Overall Accuracy | 0.740 | 634 | ||
These words and phrases are most strongly associated with skeptical content in Pausanias.
| Word/Phrase | Coefficient | Skeptical Count | Non-skeptical Count | p-value | q-value |
|---|---|---|---|---|---|
| δοκεῖν | 3.3083 | 78 | 17 | 4.9e-29 | 2.45e-26 |
| εἶναι | 3.1848 | 299 | 376 | 4.51e-25 | 1.13e-22 |
| ἐμοὶ | 2.9970 | 77 | 16 | 3.65e-29 | 2.45e-26 |
| οἶδα | 2.8773 | 69 | 31 | 4.31e-18 | 8.62e-16 |
| ἤκουσα | 2.7953 | 36 | 4 | 1.87e-16 | 2.35e-14 |
| γε | 2.7931 | 136 | 125 | 6.96e-18 | 1.16e-15 |
| λέγουσιν | 2.7483 | 141 | 176 | 3.07e-11 | 2.36e-09 |
| ἐμοὶ δοκεῖν | 2.5523 | 65 | 13 | 4.49e-25 | 1.13e-22 |
| λόγῳ | 2.4811 | 61 | 53 | 5.02e-09 | 2.95e-07 |
| μοι | 2.1265 | 105 | 101 | 4.93e-13 | 4.93e-11 |
| εἴτε | 2.1097 | 16 | 3 | 4.95e-07 | 1.9e-05 |
| οὐκ | 2.0901 | 243 | 331 | 1.88e-16 | 2.35e-14 |
| εἰ | 2.0397 | 93 | 78 | 7.52e-14 | 8.35e-12 |
| ὅτι | 2.0252 | 117 | 138 | 1.87e-10 | 1.24e-08 |
| φασιν | 2.0051 | 104 | 127 | 8.63e-09 | 4.54e-07 |
| γενέσθαι | 2.0014 | 123 | 188 | 3.22e-06 | 0.000104 |
| τις | 1.9468 | 64 | 62 | 3.27e-08 | 1.49e-06 |
| λόγον | 1.8304 | 54 | 30 | 2.13e-12 | 1.83e-10 |
| ἔπη | 1.7843 | 48 | 23 | 2.19e-12 | 1.83e-10 |
| εἰ δὲ | 1.7241 | 33 | 13 | 5.39e-10 | 3.37e-08 |
| ἄν | 1.6055 | 33 | 18 | 3.75e-08 | 1.63e-06 |
| οὐδέν | 1.4722 | 39 | 46 | 0.000364 | 0.00597 |
| φασὶν | 1.4627 | 72 | 84 | 6.92e-07 | 2.47e-05 |
| ὡς | 1.4579 | 283 | 629 | 0.0146 | 0.0994 |
| ἐγὼ | 1.4301 | 36 | 14 | 7.28e-11 | 5.2e-09 |
| οὐχ | 1.4073 | 67 | 85 | 1.33e-05 | 0.000334 |
| δὴ | 1.4055 | 280 | 578 | 0.000391 | 0.0063 |
| μνῆμα | 1.3839 | 50 | 70 | 0.00101 | 0.0131 |
| λόγος | 1.3324 | 57 | 54 | 1.19e-07 | 4.97e-06 |
| οὐ | 1.3306 | 278 | 539 | 9.12e-06 | 0.000253 |
These words and phrases are most strongly associated with non-skeptical content in Pausanias.
| Word/Phrase | Coefficient | Skeptical Count | Non-skeptical Count | p-value | q-value |
|---|---|---|---|---|---|
| δύο | -1.4158 | 27 | 120 | 0.00577 | 0.049 |
| ἅτε | -1.3304 | 27 | 122 | 0.0043 | 0.0413 |
| καὶ | -1.2963 | 861 | 2230 | 0.464 | 0.719 |
| τότε δὲ | -1.2497 | 5 | 58 | 6.02e-05 | 0.00134 |
| τὸ δὲ | -1.2313 | 77 | 246 | 0.078 | 0.277 |
| σφισι | -1.1751 | 34 | 125 | 0.0521 | 0.222 |
| θαλάσσῃ | -1.1691 | 10 | 65 | 0.00217 | 0.0236 |
| ἐπὶ τὴν | -1.1638 | 12 | 58 | 0.0325 | 0.169 |
| κεῖται | -1.1581 | 7 | 62 | 0.000247 | 0.00449 |
| μετὰ | -1.1357 | 69 | 259 | 0.0025 | 0.0268 |
| τούτῳ | -1.1087 | 44 | 129 | 0.444 | 0.698 |
| ποσειδῶνος | -1.0981 | 16 | 66 | 0.0739 | 0.273 |
| λακεδαιμονίων | -1.0721 | 29 | 143 | 0.000471 | 0.00733 |
| ἐκ τῶν | -1.0453 | 13 | 63 | 0.0252 | 0.147 |
| αὖθις | -1.0393 | 36 | 142 | 0.0148 | 0.0997 |
| τὴν μὲν | -1.0390 | 6 | 36 | 0.0339 | 0.173 |
| ὅπλα | -1.0360 | 3 | 32 | 0.00753 | 0.0598 |
| μάλιστα | -1.0165 | 95 | 302 | 0.053 | 0.222 |
| χώραν | -1.0030 | 20 | 82 | 0.0482 | 0.22 |
| τῶν | -0.9962 | 431 | 1165 | 0.233 | 0.513 |
| καὶ ἄγαλμα | -0.9962 | 12 | 70 | 0.00375 | 0.0375 |
| ἀνέθηκεν | -0.9471 | 4 | 30 | 0.0343 | 0.173 |
| ἐπὶ θαλάσσῃ | -0.9196 | 3 | 39 | 0.00147 | 0.0174 |
| μεσσηνίους | -0.9186 | 11 | 48 | 0.0942 | 0.314 |
| πρὸς | -0.8981 | 163 | 500 | 0.0284 | 0.157 |
| βασιλεὺς | -0.8955 | 7 | 54 | 0.00162 | 0.0188 |
| τοῖς δὲ | -0.8898 | 7 | 54 | 0.00162 | 0.0188 |
| ὑπὲρ | -0.8742 | 50 | 173 | 0.0514 | 0.221 |
| τῆς | -0.8686 | 405 | 1152 | 0.0168 | 0.108 |
| ἐνταῦθα | -0.8630 | 120 | 346 | 0.249 | 0.532 |