Sentence-level words and phrases that predict skepticism vs. non-skepticism
The following metrics are from the logistic regression classifier's performance on the test set:
| Class | Precision | Recall | F1-Score | Support |
|---|---|---|---|---|
| Non-skeptical | 0.933 | 0.813 | 0.869 | 358 |
| Skeptical | 0.309 | 0.588 | 0.405 | 51 |
| Overall Accuracy | 0.785 | 409 | ||
| Word/Phrase | Coefficient | Skeptical Count | Non-skeptical Count | p-value | q-value |
|---|---|---|---|---|---|
| λέγεται | 5.0493 | 36 | 9 | 2.22e-23 | 2.22e-20 |
| λέγουσιν | 3.6427 | 40 | 37 | 3.82e-15 | 1.91e-12 |
| φασιν | 3.3174 | 20 | 12 | 4.18e-10 | 1.05e-07 |
| λέγουσι | 2.9490 | 27 | 18 | 1.34e-12 | 4.47e-10 |
| φασὶν | 2.9417 | 15 | 6 | 3.65e-09 | 7.3e-07 |
| φασὶ | 2.6642 | 12 | 2 | 5.94e-09 | 9.9e-07 |
| εἶναι | 2.4526 | 44 | 87 | 1.86e-08 | 2.65e-06 |
| εἴτε | 2.4226 | 7 | 1 | 9.92e-06 | 0.0011 |
| ἐγὼ | 1.9903 | 9 | 7 | 0.000111 | 0.0079 |
| εἰ | 1.9145 | 11 | 23 | 0.00912 | 0.207 |
| ποιῆσαι | 1.9089 | 7 | 4 | 0.000273 | 0.0152 |
| οὐδένα | 1.8459 | 6 | 7 | 0.00677 | 0.174 |
| ὡς | 1.8144 | 52 | 157 | 4.37e-05 | 0.00397 |
| γενέσθαι | 1.7717 | 24 | 37 | 1.74e-06 | 0.000217 |
| ἄνδρα | 1.7716 | 8 | 11 | 0.00377 | 0.13 |
| φασι | 1.6321 | 5 | 5 | 0.00847 | 0.197 |
| τὴν | 1.5932 | 99 | 400 | 0.000316 | 0.0166 |
| ταύτην | 1.5549 | 12 | 21 | 0.00195 | 0.078 |
| ὀστᾶ | 1.5044 | 5 | 4 | 0.00509 | 0.146 |
| δοκεῖν | 1.4617 | 9 | 8 | 0.000217 | 0.0128 |
| δοκῶ | 1.4613 | 7 | 2 | 3.9e-05 | 0.0039 |
| νομίζουσι | 1.4571 | 4 | 3 | 0.0112 | 0.229 |
| ἔχει | 1.4230 | 12 | 35 | 0.0506 | 0.562 |
| σαφὲς | 1.4079 | 5 | 2 | 0.0011 | 0.0522 |
| ἀποθανεῖν | 1.3882 | 11 | 15 | 0.000628 | 0.0314 |
| νίσου | 1.3834 | 4 | 2 | 0.00543 | 0.151 |
| ὁμήρου | 1.3812 | 6 | 5 | 0.00206 | 0.0794 |
| οὐδέν | 1.2654 | 7 | 11 | 0.0118 | 0.237 |
| ὅστις | 1.2063 | 4 | 12 | 0.277 | 0.984 |
| ἐμοὶ δοκεῖν | 1.1985 | 7 | 7 | 0.00181 | 0.0755 |
| Word/Phrase | Coefficient | Skeptical Count | Non-skeptical Count | p-value | q-value |
|---|---|---|---|---|---|
| καὶ | -1.5854 | 175 | 1140 | 0.0155 | 0.293 |
| σφισιν | -1.2774 | 3 | 56 | 0.0379 | 0.541 |
| ὑπὲρ | -1.1886 | 4 | 41 | 0.392 | 1 |
| ἔτι | -1.1052 | 7 | 82 | 0.0437 | 0.546 |
| τούτοις | -0.9906 | 1 | 22 | 0.236 | 0.969 |
| ἱερὸν | -0.9551 | 1 | 35 | 0.0527 | 0.572 |
| δὲ καὶ | -0.9315 | 19 | 161 | 0.0848 | 0.742 |
| ὃν | -0.8930 | 3 | 30 | 0.464 | 1 |
| δὲ τῆς | -0.8411 | 1 | 20 | 0.348 | 0.99 |
| ἐστιν | -0.8307 | 16 | 149 | 0.0453 | 0.546 |
| ἦν | -0.8228 | 13 | 89 | 0.554 | 1 |
| μακεδόνων | -0.8116 | 0 | 25 | 0.0405 | 0.546 |
| πλησίον | -0.7932 | 4 | 46 | 0.225 | 0.969 |
| μετὰ | -0.7627 | 4 | 61 | 0.0493 | 0.562 |
| εἰσὶ | -0.7479 | 1 | 20 | 0.348 | 0.99 |
| πύρρος | -0.7454 | 0 | 18 | 0.0941 | 0.759 |
| ἀθηναίοις | -0.7319 | 9 | 68 | 0.429 | 1 |
| δὲ τοῦ | -0.7284 | 3 | 38 | 0.262 | 0.984 |
| τὸν παῖδα | -0.7223 | 1 | 9 | 1 | 1 |
| πολέμῳ | -0.7212 | 0 | 14 | 0.245 | 0.977 |
| λέσχεως | -0.7207 | 0 | 11 | 0.385 | 0.99 |
| διʼ | -0.7185 | 1 | 21 | 0.235 | 0.969 |
| παῖδες | -0.6995 | 0 | 20 | 0.0601 | 0.629 |
| διὸς | -0.6813 | 1 | 25 | 0.162 | 0.956 |
| ἀθηναῖοι | -0.6799 | 3 | 41 | 0.194 | 0.969 |
| ἐναντία | -0.6723 | 0 | 14 | 0.245 | 0.977 |
| ὕδωρ | -0.6720 | 3 | 16 | 0.752 | 1 |
| τότε | -0.6670 | 3 | 57 | 0.0262 | 0.43 |
| ἕλληνες | -0.6648 | 2 | 17 | 1 | 1 |
| γυναῖκες | -0.6510 | 0 | 13 | 0.237 | 0.969 |