SMU CS 5337/7337 Spring 2020 Preliminary Schedule

This page is maintained as the latest schedule of content and activities.

Date Topics Activity
Jan 21
Jan 23
Course overview, Introduction to IR (chpt 1) Hmwk 1 assigned, Reimaginging Search
Jan 28
Jan 30
Boolean Retrieval, Terms, Posting Lists (chpt 2) Hmwk 2 assigned, Dark Web article, Mercator article
Feb 4
Feb 6
The Web and crawling (chpt 19, 20) Hmwk 1 due
Feb 11
Feb 13
Scoring (vector space model) (chpt 6) Hmwk 2 due, Hmwk 3 assigned, project 1 assigned, figure 6.12, brown cow, cow 1234 files, little lamb, textfiles
Feb 18
Feb 20
Wildcards and spelling errors (chpt 3) Levenshtein disance, soundex online
Feb 25
Feb 27
Indexing (chpt 4), Index Compression (chpt 5) Hmwk 3 due, Letter distribution, word morphing, Variable byte C program
Mar 3
Mar 5
Scoring (complete search model) (chpt 7) Hmwk4 assigned, badfile
Mar 10
Mar 12
catch up, review, midterm Exam I
Mar 17
Mar 19
No class: Spring Break Example of a noindex page, more text files
Mar 24
Mar 26
Evaluation in information retrieval (chpt 8) Hmwk 4 due, Project 1 due
Mar 31
Apr 2
Relevance feedback and query expansion (chpt 9) Hmwk 5 assigned, Project 2 assigned, co-occurence matrix exmple
Apr 7
Apr 9
Text Classification (chpt 13) roadmap, SEO email, Bayes classifier, Poem and classifier, football-hockey example,
Apr 14
Apr 16
Document Clustering (chpt 16, 17) Hmwk 5 due, Hmwk 6 assigned, Poem k-means and kNN, hierarchical demo
Apr 21
Apr 23
Link Analysis (chpt 21) The classics: Google and PageRank HITS algorithm
Apr 28
Apr 30
Vector space classification (chpt 14) Hmwk 6 due
May 5 No class: Reading Day Project 2 due
May 7-9 Exam II, Take-home