Analyses › Surface, including books 4 and 8, with rhetoric markers
Surface, including books 4 and 8, with rhetoric markers
This model drops sentences tagged other and fits a balanced TF-IDF logistic regression to the remaining mythic and historical tags.
Model Performance Metrics
The following metrics are from the logistic regression classifier's performance on the test set:
| Class |
Precision |
Recall |
F1-Score |
Support |
| Historical |
0.610 |
0.669 |
0.638 |
248 |
| Mythic |
0.749 |
0.698 |
0.723 |
351 |
| Overall Accuracy |
0.686 |
599 |
Confusion Matrix
|
Predicted |
| Historical |
Mythic |
| Actual |
Historical |
166 |
82 |
| Mythic |
106 |
245 |
Counts
2,393mythic/historical sentences
1,404mythic
989historical
1,000features
Predictors of Mythic Sentences
Sort by:
| Word/Phrase |
English |
Coefficient |
Mythic Count |
Historical Count |
p-value |
q-value |
| εἶναι |
|
2.3079 |
168 |
36 |
4.29e-14 |
4.29e-11 |
| φασιν |
|
1.7165 |
70 |
9 |
2.53e-09 |
8.43e-07 |
| λέγουσιν |
|
1.5723 |
121 |
24 |
3.85e-11 |
1.93e-08 |
| ἐστιν |
|
1.5705 |
122 |
39 |
2.52e-06 |
0.000315 |
| φασὶν |
|
1.5133 |
43 |
5 |
1.62e-06 |
0.00027 |
| λέγουσι |
|
1.3411 |
55 |
10 |
4.55e-06 |
0.000455 |
| ἐστὶν |
|
1.3015 |
55 |
20 |
0.00718 |
0.0876 |
| πρῶτον |
|
1.2349 |
44 |
11 |
0.000673 |
0.0204 |
| δʼ |
|
1.2317 |
32 |
4 |
0.00011 |
0.00551 |
| αὐτοῦ |
|
1.2258 |
45 |
16 |
0.0127 |
0.115 |
| παῖδασ |
|
1.2226 |
44 |
9 |
0.000118 |
0.0056 |
| βωμὸσ |
|
1.2179 |
17 |
0 |
0.000196 |
0.00853 |
| ἔπη |
|
1.1653 |
16 |
0 |
0.000374 |
0.0129 |
| διὰ |
|
1.1627 |
48 |
19 |
0.0254 |
0.161 |
| ἱδρύσατο |
|
1.1518 |
10 |
0 |
0.00694 |
0.0857 |
| ὅτι |
|
1.1294 |
51 |
16 |
0.00236 |
0.0429 |
| ἔστι |
|
1.0696 |
48 |
15 |
0.00309 |
0.0542 |
| αὐτήν |
|
1.0636 |
10 |
2 |
0.139 |
0.395 |
| θυγάτηρ |
|
1.0162 |
7 |
2 |
0.322 |
0.61 |
| αὐτῶι |
|
1.0105 |
65 |
22 |
0.00144 |
0.0327 |
| ὄντα |
|
0.9783 |
29 |
9 |
0.0212 |
0.157 |
| λέγεται |
|
0.9651 |
39 |
13 |
0.0127 |
0.115 |
| λόγον |
|
0.9611 |
16 |
3 |
0.0325 |
0.19 |
| ὄνομα |
|
0.9496 |
53 |
14 |
0.000327 |
0.0126 |
| ἀρχαῖον |
|
0.9395 |
18 |
3 |
0.0128 |
0.115 |
| καὶ ὅτι |
|
0.9352 |
13 |
0 |
0.00115 |
0.0278 |
| λέγοντεσ |
|
0.9172 |
17 |
1 |
0.00121 |
0.0282 |
| αὐτῆι |
|
0.9077 |
21 |
5 |
0.0157 |
0.133 |
| φασι |
|
0.8990 |
21 |
4 |
0.0126 |
0.115 |
| καὶ εἶναι |
|
0.8918 |
12 |
0 |
0.00207 |
0.039 |
Predictors of Historical Sentences
Sort by:
| Word/Phrase |
English |
Coefficient |
Mythic Count |
Historical Count |
p-value |
q-value |
| τούτουσ |
|
-1.5637 |
6 |
13 |
0.0168 |
0.136 |
| πόλεμον |
|
-1.5465 |
6 |
28 |
7.3e-07 |
0.000146 |
| πολέμωι |
|
-1.5365 |
4 |
22 |
7.39e-06 |
0.000672 |
| πολέμου |
|
-1.5144 |
7 |
24 |
3.78e-05 |
0.0023 |
| ἄνδρασ |
|
-1.4347 |
5 |
16 |
0.0011 |
0.0275 |
| οἰκίασ |
|
-1.3542 |
2 |
16 |
3.9e-05 |
0.0023 |
| καὶ ἐσ |
|
-1.2835 |
38 |
55 |
0.000426 |
0.014 |
| ἐναντία |
|
-1.2082 |
0 |
13 |
9.8e-06 |
0.000816 |
| στρατιᾶι |
|
-1.2080 |
1 |
14 |
3.72e-05 |
0.0023 |
| ναυσὶν |
|
-1.1911 |
3 |
22 |
3.13e-06 |
0.000348 |
| μὴ |
|
-1.1389 |
19 |
25 |
0.037 |
0.207 |
| βασιλέων |
|
-1.1204 |
0 |
12 |
2.39e-05 |
0.00171 |
| πολλά |
|
-1.1150 |
0 |
6 |
0.00494 |
0.0726 |
| ὅσον |
|
-1.1001 |
6 |
15 |
0.00508 |
0.0726 |
| αὐτὸσ |
|
-1.0561 |
19 |
34 |
0.00072 |
0.0212 |
| κόσμον |
|
-1.0367 |
0 |
7 |
0.00203 |
0.039 |
| σφῶν |
|
-1.0357 |
0 |
9 |
0.000344 |
0.0128 |
| ποτε |
|
-1.0251 |
12 |
14 |
0.197 |
0.472 |
| ἤδη |
|
-1.0224 |
23 |
31 |
0.0163 |
0.135 |
| χαλκοῦν |
|
-1.0075 |
2 |
9 |
0.0103 |
0.113 |
| καὶ ἔτι |
|
-0.9827 |
0 |
8 |
0.000837 |
0.0226 |
| ὃσ |
|
-0.9707 |
23 |
39 |
0.000534 |
0.0167 |
| οὐ |
|
-0.9587 |
86 |
76 |
0.137 |
0.395 |
| οὖν |
|
-0.9345 |
34 |
40 |
0.0252 |
0.161 |
| πρώτοισ |
|
-0.9340 |
3 |
6 |
0.175 |
0.423 |
| πλέον |
|
-0.9255 |
5 |
14 |
0.0041 |
0.0684 |
| ὅσοι |
|
-0.9242 |
8 |
16 |
0.0119 |
0.115 |
| κοινῶι |
|
-0.9161 |
2 |
7 |
0.0383 |
0.207 |
| ἐρείπια |
|
-0.9035 |
1 |
5 |
0.0878 |
0.302 |
| ἄλλα |
|
-0.8884 |
22 |
32 |
0.00737 |
0.0888 |