Tagsets
Morphological Tags
MTAAC has adapted a set of 92 morphological tags (28 nominal tags, 57 finite verbal tags, 7 non-finite verbal tags) from the Electronic Text Corpus of Sumerial Royal Inscriptions Project (ETCSRI), with modifications. For the full list of these tags, please visit the following ETCSRI URL:
http://oracc.museum.upenn.edu/etcsri/parsing/index.html
The following nine morphological tags deviate from that found at ETCSRI and are specific to the MTAAC project:
ETCSRI | MTAAC | |
---|---|---|
Finite Verbal Stem | STEM | V |
Finite Plural Stem | STEM-PL | V.PL |
Finite Reduplicated Stem | STEM-RDP | V.RDP |
Non-finite Verbal Stem Abolsute | STEM.ABS | NF.V.ABS |
Non-finite Verbal Stem Preterite | STEM.SUB | NF.V.PT |
Non-finite Verbal Stem Present-Future | STEM.PF | NF.V.F |
Non-Finite Plural Stem | STEM-PL | NF.V.PL |
Non-Finite Reduplicated Stem | STEM-RDP | NF.V.RDP |
Non-Finite Present-Future Stem | STEM-PF | NF.V.PF |
Named Entities
ORACC: http://oracc.museum.upenn.edu/doc/help/languages/propernouns/index.html
ETCSL: http://etcsl.orinst.ox.ac.uk/edition2/etcslhelp.php#propernouns
Named Entities | |||
---|---|---|---|
ORACC | ETCSL | ||
DN | Divine Name | DN | divine name |
EN | Ethnos Name | EN | ethnic name |
GN | Geographical Name* | GN | geographical name |
MN | Month Name | MN | month name |
ON** | Object Name | ON** | other name |
PN | Personal Name | PN | personal name |
RN | Royal Name | RN | royal name |
SN | Settlement Name | SN | settlement name |
TN | Temple Name | TN | temple name |
WN | Watercourse Name | WN | watercourse name |
AN | Agricultural (locus) Name | ||
CN | Celestial Name | ||
FN | Field Name | ||
LN | Line Name (ancestral clan) | ||
QN | Quarter Name (city area) | ||
YN | Year Name |
- Lands and other geographical entities without their own tag.
** Note the different usages of the ON tag.
Part of Speech (POS) tags
ORACC: http://oracc.museum.upenn.edu/doc/help/languages/akkadian/index.html (scroll down to half the page)
ETCSL: http://etcsl.orinst.ox.ac.uk/edition2/etcslhelp.php#pos
UD: http://universaldependencies.org/docs/en/pos/all.html
Penn: https://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html
ORACC | ETCSL | UD | Penn Treebank POS | |||||
---|---|---|---|---|---|---|---|---|
AJ | adjective (including statives) | AJ | adjective | ADJ | adjective | JJ | Adjective | 7 |
JJR | Adjective, comparative | 8 | ||||||
JJS | Adjective, superlative | 9 | ||||||
AV | adverb | AV | adverb | ADV | RB | Adverb | 20 | |
RBR | Adverb, comparative | 21 | ||||||
RBS | Adverb, superlative | 22 | ||||||
NU | number | NU | numeral | NUM | Numeral | CD | Cardinal number | 2 |
CNJ | conjunction | C | conjunction | CONJ | coordinating conjunction | CC | Coordinating conjunction | 1 |
DET | determinative pronoun | PD | pronoun/determiner | DET | determiner | DT | Determiner | 3 |
EX | Existential there | 4 | ||||||
FW | Foreign word | 5 | ||||||
J | interjection | I | interjection | INTJ | interjection | UH | interjection | 26 |
LS | List item marker | 10 | ||||||
NNS | Noun, plural | 13 | ||||||
N | noun (including statives) | N | noun | NOUN | Noun | NN | Noun, singular or mass | 12 |
PART | particle | PRT | particle | (chunk tags) | ||||
PRP | Personal pronoun | 18 | ||||||
POS | Possessive ending | 17 | ||||||
PP | possessive pronoun | PRP$ | Possessive pronoun | 19 | ||||
WP$ | Possessive wh-pronoun | 35 | ||||||
PDT | Predeterminer | 16 | ||||||
SCONJ | subordinating conjunction | IN | Preposition or subordinating conjunction | 6 | ||||
NNPS | Proper noun, plural | 15 | ||||||
PROPN | Proper noun | NNP | Proper noun, singular | 14 | ||||
SYM | symbol | SYM | Symbol | 24 | ||||
TO | to | 25 | ||||||
VBZ | Verb, 3rd person singular present | 32 | ||||||
V | verb (including infinitives, marked with EPOS ‘N and gerundive GW) | V | verb | VERB | Verb | VB | Verb, base form | 27 |
VBG | Verb, gerund or present participle | 29 | ||||||
VBP | Verb, non-3rd person singular present | 31 | ||||||
VBN | Verb, past participle | 30 | ||||||
VBD | Verb, past tense | 28 | ||||||
WRB | Wh-adverb | 36 | ||||||
WDT | Wh-determiner | 33 | ||||||
WP | Wh-pronoun | 34 | ||||||
ADP | adposition | |||||||
AUX | auxiliary verb | |||||||
IP | independent/anaphoric pronoun | |||||||
DP | demonstrative pronoun | |||||||
MOD | modal, negative, or conditional particle | MD | Modal | 11 | ||||
PRP | preposition | |||||||
QP | interrogative pronoun | |||||||
RP | reflexive/reciprocal pronoun | |||||||
REL | relative pronoun | |||||||
SBJ | subjunction | |||||||
XP | indefinite pronoun | |||||||
NEG | negator | |||||||
PUNCT | punctuation | |||||||
X | Other |
Morphological tags
Morphological tags — comparative chart
Morphology chart under development
The ablative’s function of distributive is not distinguished in the two sets.
ETCSL Morphological tags
The ETCSL morphological tags mostly give a normalization that can then be treated with rules to disambiguate meaning. Exception is made for the verbal bases which are explicitly identified (aspect, plural, etc).
- Creating Tools for Morphological Analysis of Sumerian (link to the pdf)
Some examples:
POS | form | normalized |
---|---|---|
N | kur-kur-ra-gin7 | L,ak.gin |
N | cag4-gada-la2-be2-e-ne | L,bi.ene |
V | L,ani.ak | hul2-la-ni-a |
V | mu.ra.I.n:L | e-ri-in-bar-re-ec |
V | ga.V.n.ci:L | ga-an-ci-gen |
V | ha.ba.NI.n:L | ha-ba-ni-in-dug4 |
The tags seem only to appear on words that display affixes (the absolutive is not marked).
For a full description of the morphological annotation system, see the ETCSRI website.
The ETCSRI morphological tagging is very explicit and consists of three different attributes:
- The morpheme column based on logical table of possible suites of morphemes
- The usage of the morphemes
- The normalized form of the text
Attributes 2 and 3 are used in our CoNLL-U annotation in a hybrid attribute, eg:
iri-kug-ga-ka-ni
Original
N1=Irikug.N5=ak.N3=ani.N5=ø
N1=NAME.N5=GEN.N3=3-SG-H-POSS.N5=ABS
Irikug.ak.ani.ø
MTAAC modified version
Iriku=NAME.ak=GEN.ani=3-SG-H-POSS.ø=ABS
Missing affixes are added in the morphology, eg. nin-a-ni becomes
N1=nin.N3=ani.N5=ra
N1=STEM.N3=3-SG-H-POSS.N5=DAT-H
nin.ani.ra
or
nin=STEM.ani=3-SG-H-POSS.ra=DAT
The morphological tags are explicit and give information not only about the morpheme at hand but hint at the construction of the whole verbal chain. Eg:
morpheme | tag | meaning |
---|---|---|
enden | 1-PL-A | first person plural agent suffix |
enden | 1-PL-S | first person plural subject suffix (df) |
enden | 1-PL | first person plural suffix in plural transitive preterite verbal forms |
Or they give indications about subtleties in meaning:
morpheme | tag | meaning |
---|---|---|
e | DAT-NH | non-human dative case-marker |
e | L3-NH | non-human locative3 case-marker |
Émilie Pagé-Perron