top of page
ABV_DiversityInclusion_1044x588.jpg

CS 6120 - Natural Language Processing 


Instructor: Malihe Alikhani
Time: Wednesday 6-9 pm

Office Hours: Wednesday 10-11 am

Course description

This course introduces students to the foundations of natural language processing (NLP), large language models (LLMs), and their applications. Through a mix of lectures, assignments, and presentations, students will gain both theoretical grounding and practical experience with modern NLP systems, from word embeddings to cutting-edge LLMs and their ethical considerations.

 

Schedule & Modules
 

Foundations (Weeks 1–3)

  • Introduction to NLP and word embeddings

  • Machine translation

  • Parsing, discourse, and coreference resolution

Large Language Models (Weeks 4–6)

  • Introduction to LLMs, architectures, and pre-training

  • Post-training methods and reinforcement learning with human feedback (RLHF)

  • Midterm presentations

  • Prompting methods: chain-of-thought, zero-shot reasoning, and more

Applications (Weeks 7–10)

  • Dialogue systems

  • Multimodality and robotics

  • Sign language processing

  • Open application topics

Ethics & Responsible AI (Weeks 11–12)

  • Social and ethical considerations in NLP and AI systems

  • Case studies and critical discussions

Final Projects (Weeks 13–14)

  • Wrap up and student final presentations
     

Key Dates
  • Jan 7: Course begins

  • Feb 11: Midterm presentations

  • April 22: Final presentations

Reference: Speech and Language Processing (3rd ed. draft)
Dan Jurafsky and James H. Martin 

 

Final research project

Final projects should ideally be done by teams of two to four students. Projects done by one or three students are possible on rare occasions with prior approval of the instructor. Each student is responsible for finding a partner for their final project. Feel free to post a message on Slack about your interests in order to find a partner with similar interests. ​

Submit a one-page project proposal by January 21 that briefly covers the questions shared in the slide dec with you. I will provide feedback on these proposals, but they will not be graded. Students are encouraged to discuss project proposals with me in office hours before submitting them. Groups are required to give a 15-minute presentation about their progress on the final project in the 6th week of the class. During the last week of class, each team will be required to give a 20-minute presentation on the current state of their project. This will allow for two project presentations per class. Use the same format as the paired research paper presentations (see above), but you may have only preliminary actual results at that time to present. Submit the code to the GitHub repo for the class at midnight on April 26. Final project reports are due on Canvas at midnight on April 26. They should be formatted to meet the requirements of paper submission to the ACL conference as described here.

Final grade

The final grade will be computed as follows:​

HW: 35%

  • HW0: 0% (optional)

  • HW1: 10% 

  • HW2: 10% 

  • HW3: 5% 

  • HW4: 10%

Midterm presentation: 15%

Paper presenation: 5%

In-class assignments: 10% (must be submitted to the TA within 24 hours)

Final Project: 30%​​

Participation: 5% (You can skip up to two classes. Please turn on the camera when you attend the class. )

Academic integrity
All assignment submissions must be the sole work of each individual student. Students may not read or copy another student's solutions or share their own solutions with other students. Students may not review solutions from students who have taken the course in previous years. Submissions that are substantively similar will be considered cheating by all students involved, and as such, students must be mindful not to post their code publicly. The use of books and online resources is allowed, but must be credited in submissions, and material may not be copied verbatim. Any use of electronics or other resources during an examination will be considered cheating.

If you have any doubts about whether a particular action may be construed as cheating, ask the instructor for clarification before you do it. The instructor will make the final determination of what is considered cheating.
Cheating in this course will result in a grade of F for the course and may be subject to further disciplinary action.

Using an open-source codebase is accepted, but you must explicitly cite the source, especially following the owner's guideline if it exists. For any writing involved in the project, plagiarism is strictly prohibited. If you are unclear whether your work will be considered as plagiarism, ask the instructor before submitting or presenting the work

Students with disabilities

If you have a disability for which you are or may be requesting an accommodation, you are encouraged to contact both your instructor and Disability Resources and Services as early as possible in the term. DRS will verify your disability and determine reasonable accommodations for this course. More info at https://drc.sites.northeastern.edu/.

Audio/video recordings

To ensure the free and open discussion of ideas, students may not record classroom lectures, discussions, and/or activities without the advance written permission of the instructor, and any such recording properly approved in advance can be used solely for the student's own private use. 

 

Copyrighted materials

All material provided through this web site is subject to copyright. This applies to class/recitation notes, slides, assignments, solutions, project descriptions, etc. You are allowed (and expected!) to use all the provided material for personal use. However, you are strictly prohibited from sharing the material with others in general and from posting the material on the Web or other file sharing venues in particular.


You have only one week after the grades are posted to request clarification or regrading. 

bottom of page