This page is maintained as the latest schedule of content and activities.
Date | Topics | Activity |
Jan 21 Jan 23 |
Course overview, Introduction to IR (chpt 1) | Hmwk 1 assigned, Reimaginging Search |
Jan 28 Jan 30 |
Boolean Retrieval, Terms, Posting Lists (chpt 2) | Hmwk 2 assigned, Dark Web article, Mercator article |
Feb 4 Feb 6 |
The Web and crawling (chpt 19, 20) | Hmwk 1 due |
Feb 11 Feb 13 |
Scoring (vector space model) (chpt 6) | Hmwk 2 due, Hmwk 3 assigned, project 1 assigned, figure 6.12, brown cow, cow 1234 files, little lamb, textfiles |
Feb 18 Feb 20 |
Wildcards and spelling errors (chpt 3) | Levenshtein disance, soundex online |
Feb 25 Feb 27 |
Indexing (chpt 4), Index Compression (chpt 5) | Hmwk 3 due, Letter distribution, word morphing, Variable byte C program |
Mar 3 Mar 5 |
Scoring (complete search model) (chpt 7) | Hmwk4 assigned, badfile |
Mar 10 Mar 12 |
catch up, review, midterm | Exam I |
Mar 19 |
No class: Spring Break | Example of a noindex page, more text files |
Mar 24 Mar 26 |
Evaluation in information retrieval (chpt 8) | Hmwk 4 due, Project 1 due |
Mar 31 Apr 2 |
Relevance feedback and query expansion (chpt 9) | Hmwk 5 assigned, Project 2 assigned, co-occurence matrix exmple |
Apr 7 Apr 9 |
Text Classification (chpt 13) | roadmap, SEO email, Bayes classifier, Poem and classifier, football-hockey example, |
Apr 14 Apr 16 |
Document Clustering (chpt 16, 17) | Hmwk 5 due, Hmwk 6 assigned, Poem k-means and kNN, hierarchical demo |
Apr 21 Apr 23 |
Link Analysis (chpt 21) | The classics: Google and PageRank HITS algorithm |
Apr 28 Apr 30 |
Vector space classification (chpt 14) | Hmwk 6 due |
No class: Reading Day | Project 2 due | |
May 7-9 | Exam II, Take-home |