| Lectures: | Tuesday and Thursday, 2:00-3:20, EMCB 120 |
| Textbook: | Speech and Language Processing: An Introduction to Natural Language |
| Processing, Computational Linguistics, and Speech Recognition | |
| by Daniel Jurafsky and James Martin | |
| Instructor: | Professor Ellen Riloff |
| E-mail: | teach-cs5340@cs.utah.edu (preferred) or riloff@cs.utah.edu |
| Office: | 3140 MEB |
| Consulting Hours: | anytime when my door is open, and by appointment |
| TAs: | Bryce Anderson, Sean Igo |
| E-mail: | teach-cs5340@cs.utah.edu (preferred) |
| or andersbr@cs.utah,edu, sigo@cs.utah.edu | |
| Consulting Hours: | by appointment |
| Date | Topic | Reading |
|---|---|---|
| AUG 24 | Introduction to and history of NLP | Ch. 1, 2 |
| AUG 29 | Syntax: the basics | Ch. 3.1-3.2 (pp. 57-70), 3.4-3.6 |
| Ch. 8.1-8.4 | ||
| AUG 31 | Syntax: chart parsing | Ch. 10.4 |
| SEP 5 | Syntax: transition network parsing | Ch. 9.1-9.8, 9.10-9.13 |
| Ch. 10.1-10.3, 10.5-10.6 | ||
| SEP 7 | Syntax: parsing cont'd | |
|
SEP 12 |
Probability; N-gram models | Ch. 6.1-6.3 (pp. 191-210), 8.5 |
| SEP 14 | Probabilistic algorithms | Ch. 5.9 (pp. 169-179) |
| SEP 19 | Probabilistic algorithms (cont'd) | |
| SEP 21 | Probabilistic context-free grammars | Ch. 12.1-12.3 |
| SEP 26 | Experimental design | pp. 315 |
| SEP 28 | Information extraction | Ch. 15.5 |
| OCT 3 | Semantic lexicon bootstrapping | |
| OCT 5 | FALL BREAK | |
| OCT 10 | Semantics: the basics | Ch. 14, 16 |
| OCT 12 | Semantics: the basic (cont'd) | |
| OCT 17 | MIDTERM EXAM | |
| OCT 19 | Learning extraction patterns | Handout |
| OCT 24 | Semantics: named entity recognition | |
| OCT 26 | Discourse | Ch. 18.1 |
| OCT 31 | Semantics: word sense disambiguation | Ch. 17.1-17.2 |
| NOV 2 | Transformation-based learning | Ch. 8.6 |
| NOV 7 | Conceptual dependency theory | Handout |
| NOV 9 | Applications: information retrieval and Q/A | Ch. 17.3-17.5 |
| NOV 14 | Applications: speech understanding | Ch. 7.1-7.3, 9.9 |
| NOV 16 | Applications: machine translation | Ch. 21 |
| NOV 21 | Summarization (guest lecture by
|
|
| NOV 23 | THANKSGIVING BREAK | |
| NOV 28 | Project presentations | |
| NOV 30 | Project presentations | |
| DEC 5 | Project presentations | |
| DEC 7 | Project presentations | |
| DEC 11 | FINAL EXAM 1:00-3:00 in EMCB 120 |
Grades for the course will be based on homework assignments (both written questions and programs), a project, a midterm exam, and a final exam. Overall grades will be based on the following formula:
Homework assignments must be submitted using the electronic handin system, including BOTH written and programming assignments. The written assignments must be submitted in one of the following formats: pdf, postscript, html (without hyperlinks), or plain ascii text. Instructions for using the electronic handin system will accompany each assignment. Assignments slipped under a door or left in the main office will not be accepted.
Programs may use any of the following programming languages: C++, Java, or python. You must get permission from the instructor if you wish to use any other programming language. All programs must compile and run on the linux-based CADE machines (lab1 and lab4 machines). You will receive ZERO credit for programs that do not compile and run on the linux-based CADE machines. This policy is necessary for the instructor and TAs to test and grade your programs ourselves.
Late assignments will not be accepted. The electronic submission system will be disabled promptly after the time specified as the due date for each assignment. No assignments will be accepted thereafter. Our motivation for this strict late policy is to ensure that assignments are graded and returned as quickly as possible. Allowing late assignments holds up the grading process and the distribution of solutions. We will do our best to grade the assignments and hand out solutions promptly so that everyone will get feedback in a timely fashion.
Cheating of any kind will not be tolerated. Any assignment or exam that is handed in must be your own work!! However, talking with one another to understand the material better is strongly encouraged. Recognizing the distinction between cheating and cooperation is very important.
The University of Utah seeks to provide equal access to its programs, services and activities for people with disabilities. If you will need accommodations in the class, reasonable prior notice needs to be given to the Center for Disability Services, 162 Olpin Union Building, 581-5020 (V/TDD). CDS will work with you and the instructor to make arrangements for accommodations.
All written information in this course can be made available in alternative format with prior notification to the Center for Disability Services.