Course pages 2019–20
Natural Language Processing
Assessment is by coursework as follows:
- Assignment 1, for 20% of the overall grade. Write a 500 word report of your experiment of SVM-based sentiment classification. Ticked, i.e., Pass/Fail.
- Assignment 2, for 40% of the overall grade. Write a 1000 word
report of your experiment of Doc2Vec-based sentiment classification.
- Assignment 3, for 40% of the overall grade. Write a 1000 word report of your design for a text understanding question answering system.
Deadlines:
- Assignment 1: 8 November, 4pm
- Assignment 2: 6 December, 4pm (changed from 29 November due to Industrial Action)
- Assignment 3: 6 December, 4pm
Data etc for Practical
- Here is the paper you will replicate (some aspects of): Pang et al. (2002)
- TOKENIZED DATA: NEG-token.tar and POS-token.tar
- Here is some explanation from Siegel and Castellan (1988) about sign test (pdf).
- Here are the MLRD slides on crossvalidation
- Assignment3, Text 1 in pdf, in ASCII plain text and the output of the Stanford parser on text 1
- Assignment3, Text 2 in png with its Questions, in ASCII plain text and the output of the Stanford parser on text 2
- If you send email to me (sht25), make sure all demonstrators are cc'ed in -- Guy (ga384), Gladys (whgt2), Josef (jv406), Yiwen (yc429).