Week 0
Week 1
Week 2
Week 3
Week 4
Week 5
Week 6
Week 7
Week 8
Week 9
Eval 1
Week 10
Week 11
Week 12
Eval 2
Week 13
Week 14
Week 15
Week 16
Week 17
Week 18
Week 19
Week 20
Week 21
Week 22
Eval 3
As for now, all the translation that can be done of Sumerian is that by Assyriologists, which are not that many, or by non-rule-based parsers who don’t do a good job at translating the sentences. This project aims to do the first step in the right direction to develop a rule-based morphological parser to ease the growth of further development in the study of the language of Sumer and Akkadian civilizations.
SFST is a finite-state transducer that will be used to write the morphologies for Sumerian. The analysis would be on three-level:
i. Morphological, turning the grammatical features into morphemes with morpheme boundaries and zero morphemes.
ii. Morphophonological, transcribing the morphemes to the conventional transcription (by removing morpheme boundaries).
iii. Transliteration, changing to make it orthographically accurate by the conventions of CDLI orthography.
–> Completed Tasks –> Ongoing Tasks
# | Status | Objectives | Associated Deliverables | issue(s) |
---|---|---|---|---|
1 | Create a script extracting the lexicon | Script analyses CDLI corpus and outputs in SuMor compatible format | ||
2 | Build a valid Sumerian lexicon | A tsv file with words, tags and meaning | ||
3 | Have a working parser | Changes to flexion.fst so that it can generate new morphologies in CDLI format | ||
4 | Write a whitepaper | Submit a paper to Christian Chiarcos, with stats | ||
5 | Polish the edges of SuMor | Change it to be seemless with one command and the magic happens, Add the leftout endings to the flexion file |
Week 0
Week 1
Week 2
Week 3
Week 4
Week 5
Week 6
Week 7
Week 8
Week 9
Eval 1
Week 10
Week 11
Week 12
Eval 2
Week 13
Week 14
Week 15
Week 16
Week 17
Week 18
Week 19
Week 20
Week 21
Week 22
Eval 3