Sentence-level words and phrases that predict mythic vs. historical content
The following metrics are from the logistic regression classifier's performance on the test set:
| Class | Precision | Recall | F1-Score | Support |
|---|---|---|---|---|
| Historical | 0.756 | 0.719 | 0.737 | 499 |
| Mythic | 0.574 | 0.620 | 0.596 | 305 |
| Overall Accuracy | 0.682 | 804 | ||
| Word/Phrase | English | Coefficient | Mythic Count | Non-mythic Count | p-value | q-value |
|---|---|---|---|---|---|---|
| φασιν | they say | 2.8646 | 69 | 15 | 2.16e-15 | 7.2e-13 |
| λέγουσιν | they say | 2.3508 | 117 | 41 | 2.3e-18 | 1.15e-15 |
| λέγουσι | they say | 2.3437 | 61 | 18 | 1.8e-11 | 4.51e-09 |
| εἶναι | to be | 2.1190 | 199 | 89 | 2.08e-24 | 2.08e-21 |
| ὑπὸ | under; beneath; by (of agent) | 2.0119 | 184 | 134 | 7.11e-11 | 1.18e-08 |
| φασὶν | they say | 1.9424 | 39 | 12 | 1.39e-07 | 1.07e-05 |
| μνῆμα | memorial, monument (especially a tomb) | 1.9027 | 39 | 17 | 8.84e-06 | 0.000402 |
| λέγεται | it is said | 1.7737 | 49 | 21 | 4.67e-07 | 3.34e-05 |
| παῖδεσ | children | 1.6908 | 27 | 12 | 0.000265 | 0.00647 |
| ὄνομα | name | 1.6891 | 82 | 34 | 2.36e-11 | 4.71e-09 |
| τάφοσ | tomb | 1.6758 | 24 | 14 | 0.00463 | 0.0445 |
| μνῆμά | memorial, monument, tomb | 1.6355 | 17 | 7 | 0.00254 | 0.0296 |
| αὐτὸν | him | 1.6260 | 90 | 46 | 7.56e-10 | 9.46e-08 |
| γενέσθαι | to become | 1.6014 | 84 | 38 | 1.25e-10 | 1.79e-08 |
| φασι | they say | 1.5523 | 24 | 5 | 2.56e-06 | 0.000135 |
| παῖδασ | children | 1.5411 | 31 | 9 | 1.6e-06 | 0.0001 |
| λέγοντεσ | speaking | 1.4883 | 14 | 1 | 2.85e-05 | 0.00102 |
| θυγάτηρ | daughter | 1.4852 | 14 | 1 | 2.85e-05 | 0.00102 |
| ὄντα | being; things that are (beings) | 1.4454 | 25 | 11 | 0.000417 | 0.00875 |
| πρῶτον | first; first of all | 1.4381 | 44 | 15 | 8.36e-08 | 7.6e-06 |
| ἀρχήν | at first | 1.4377 | 17 | 6 | 0.0011 | 0.0176 |
| ἱδρύσατο | he/she founded | 1.3827 | 8 | 0 | 0.00071 | 0.0125 |
| λόγον | word | 1.3563 | 22 | 5 | 1.14e-05 | 0.000449 |
| θυγατέρα | daughter | 1.3504 | 32 | 6 | 2.23e-08 | 2.23e-06 |
| αὐτὴν | her | 1.3362 | 50 | 35 | 0.000569 | 0.0109 |
| ἔχειν | to have | 1.3241 | 18 | 6 | 0.000567 | 0.0109 |
| ὅτι | that; because | 1.3173 | 52 | 42 | 0.00331 | 0.0376 |
| ἀποθανεῖν | to die | 1.2864 | 30 | 6 | 1.02e-07 | 8.52e-06 |
| πατρὸσ | of the father; father’s | 1.2604 | 14 | 3 | 0.000651 | 0.0121 |
| γένοσ | race; kind | 1.2553 | 13 | 8 | 0.0473 | 0.21 |
| Word/Phrase | English | Coefficient | Mythic Count | Non-mythic Count | p-value | q-value |
|---|---|---|---|---|---|---|
| χαλκοῦν | of bronze; brazen | -2.0214 | 2 | 36 | 1.12e-06 | 7.46e-05 |
| σφισιν | to them | -1.9376 | 26 | 87 | 7.13e-05 | 0.00198 |
| μάλιστα | especially; very much; certainly | -1.6439 | 36 | 110 | 4.41e-05 | 0.00145 |
| μὴ | not | -1.5268 | 16 | 37 | 0.119 | 0.363 |
| αὐτῶν | of them; their | -1.4077 | 31 | 82 | 0.0034 | 0.0382 |
| λίθου | of a stone | -1.3315 | 11 | 52 | 7.45e-05 | 0.00201 |
| ἀγάλματα | statues | -1.3264 | 9 | 53 | 7.16e-06 | 0.000341 |
| ναυσὶν | to the ships | -1.2777 | 5 | 28 | 0.0015 | 0.0214 |
| θέασ | of a view | -1.2624 | 4 | 39 | 1.17e-05 | 0.000449 |
| ἔξω | outside | -1.2490 | 1 | 10 | 0.0345 | 0.168 |
| ἄλλωσ | otherwise | -1.1956 | 7 | 25 | 0.0252 | 0.137 |
| ἔργον | work; deed | -1.1940 | 12 | 44 | 0.00233 | 0.0279 |
| οἵ | who | -1.1867 | 1 | 12 | 0.02 | 0.117 |
| καὶ ἐσ | and into | -1.1709 | 43 | 102 | 0.00599 | 0.0544 |
| σταδίουσ | stades | -1.1667 | 1 | 17 | 0.00139 | 0.0208 |
| στάδια | stades | -1.1430 | 0 | 19 | 6.43e-05 | 0.00189 |
| χαλκοῦσ | of bronze | -1.1185 | 4 | 22 | 0.00848 | 0.0717 |
| ἦσαν | they were | -1.1098 | 10 | 38 | 0.00362 | 0.0402 |
| ἐστιν | is | -1.0388 | 127 | 288 | 1.17e-05 | 0.000449 |
| που | somewhere; perhaps; I suppose | -1.0284 | 3 | 14 | 0.0802 | 0.271 |
| χρυσοῦ | of gold | -1.0191 | 1 | 16 | 0.00234 | 0.0279 |
| ἔτει | in the year | -1.0183 | 0 | 12 | 0.00232 | 0.0279 |
| τρόπον | manner | -1.0160 | 3 | 17 | 0.0214 | 0.124 |
| τέχνη | art, craft, skill | -0.9994 | 1 | 18 | 0.00144 | 0.0208 |
| ἐστί | is | -0.9986 | 2 | 10 | 0.139 | 0.383 |
| ἐναντία | opposite; contrary | -0.9869 | 3 | 16 | 0.0334 | 0.168 |
| πολὺ | much; very; by far | -0.9829 | 10 | 35 | 0.00919 | 0.0718 |
| ὅσα | all the things that | -0.9660 | 8 | 23 | 0.0861 | 0.284 |
| ἔργου | of work | -0.9642 | 2 | 12 | 0.0557 | 0.229 |
| ἔχοντεσ | having | -0.9637 | 3 | 15 | 0.0519 | 0.219 |