Learning Outcomes
Upon successful completion of the course, students should:
understand the basic concepts of Computational Linguistics,
become familiar with techniques and methods of natural language processing,
learn about the latest advances in the area (i.e., Large Language Models),
become aware of the ethical and legal issues
understand the Architecture of mainstream Natural Language Processing tools in order to use them efficiently,
be able to participate in the design, development and evaluation of Natural Language Processing tools,
be able able to participate in the design, development, and evaluation of datasets following annotation guidelines and standards to ensure reproducibility of their effort.
Course Content (Syllabus)
This is an introductory course in the inter-disciplinary field of Computational Linguistics and Natural Language Processing. The course is structured as follows:
1. Language Technology, Computational Linguistics, and Natural Language Processing (NLP): Basic concepts and principles.
2. A brief historical overview and significant milestones. Methods in NLP: from rule-based systems to probabilistic approaches, Large Language Models, Generative Artificial Intelligence (AI), and Human-centered AI.
3. Computational Linguistics and linguistic theory: Levels of linguistic analysis. Basic pre-processing: Part-of-Speech taggong and lemmatization.
4. Sentence structure: syntactic analysis (Context-Free Grammars, Dependency Parsing).
5. Approaches to semantics: Lexical semantics. Argument structure and semantic role role labeling. Abstract Meaning Representation.
6. Computational Semantics: Inference, paraphrasing, simplification.
7. Coreference resolution.
8. Language resources: Corpora, lexica, and tools. Corpus annotation (principles, methods, and tools).
9. Applications: Machine Translation. Information extraction: identification of events and entities, entity linking.
10. Applications: Sentiment analysis, offensive language identification. Irony, humor, and argumentation.
11. Computational Linguistics: Legal and ethical issues.
12. Presentations.
Keywords
Natural Language Processing, Natural Language Understanding, (Generative) Artificial Intelligence, Language Resources, Corpus annotation
Additional bibliography for study
Γιούλη, Π. (2024) Υπολογιστική Γλωσσολογία: Μέθοδοι και εφαρμογές. Από τις συμβολικές προσεγγίσεις στη Μηχανική Μάθηση και την Παραγωγική Τεχνητή Νοημοσύνη. Σημειώσεις μαθήματος.
Παναγιωτακόπουλος, Χ., Τσαλίδης, Χ., Γάκης, Π. και Κόκκινος, Θ. (2022). Υπολογιστική γλωσσολογία. Από τον προγραμματισμό μέχρι τη διδακτική πράξη. [Προπτυχιακό εγχειρίδιο]. Κάλλιπος, Ανοικτές Ακαδημαϊκές Εκδόσεις. https://dx.doi.org/10.57713/kallipos-127
Τάντος, Α., Μαρκαντωνάτου, Σ., Αναστασιάδη-Συμεωνίδη, Ά., Κυριακοπούλου, Π. (2015). Υπολογιστική γλωσσολογία. [ηλεκτρ. βιβλ.] Αθήνα: Σύνδεσμος Ελληνικών Ακαδημαϊκών Βιβλιοθηκών. http://hdl.handle.net/11419/2205 https://repository.kallipos.gr/handle/11419/2205
Jurafsky, D. and Martin, J. H. (2020-2021) Speech and Language Processing (3rd ed. draft) https://web.stanford.edu/~jurafsky/slp3/
Jacob Eisenstein. (2019). Natural Language Processing. MIT Press. https://cseweb.ucsd.edu/~nnakashole/teaching/eisenstein-nov18.pdf
Bender, Emily M. and Alex Lascarides. (2019). Linguistic Fundamentals for Natural Language Processing II: 100 Essentials from Semantics and Pragmatics. Synthesis Lectures on Human Language Technologies. Morgan & Claypool Publishers
Goldberg, Y. (2015). A Primer on Neural Network Models for Natural Language Processing. ArXiv, abs/1510.00726. https://www.semanticscholar.org/paper/A-Primer-on-Neural-Network-Models-for-Natural-Goldberg/56edaa1368ff4dfa45388e4be24fdfbded7d88a7