Welcome of CDLI Blogs.
Please update the author name and add tags too.
This page should contain the report made for every week.
Replace Project# with your project name.
A complete report of the work done during the week must be written here.
# | Day | Date | A short description of the work done |
---|---|---|---|
1 | Monday | 2020/06/01 | Written a code to extract data from CDLI-conll files from MTACC_GOLD Corpus |
2 | Tuesday | 2020/06/02 | Extracted CDLI Sumerian data and written a code to preprocess monolingual sumerian text as comparable to CDLI-conll tokenization |
3 | Wednesday | 2020/06/03 | prepared POS tagging dataset for training and uploaded all the relevent the dataset on github |
4 | Thursday | 2020/06/04 | Preparing rules for the POS tagging systems to be used in CRF |
5 | Friday | 2020/06/05 | Reading research papers and adding rules |
6 | Saturday | 2020/06/06 | Using previous projects to analyse language and add rules |
7 | Sunday | 2020/06/07 | Dataset rules are prepared for pos tagging and commited |