Text Retrieval and Search Engines

Product type

Text Retrieval and Search Engines

Coursera (CC)
Logo Coursera (CC)
Provider rating: starstarstarstar_halfstar_border 7.2 Coursera (CC) has an average rating of 7.2 (out of 6 reviews)

Need more information? Get more details on the site of the provider.

Description

When you enroll for courses through Coursera you get to choose for a paid plan or for a free plan

  • Free plan: No certicification and/or audit only. You will have access to all course materials except graded items.
  • Paid plan: Commit to earning a Certificate—it's a trusted, shareable way to showcase your new skills.

About this course: Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets. Text data are unique in that they are usually generated directly by humans rather than a computer system or sensors, and are thus especially valuable for discovering knowledge about people’s opinions and preferences, in addition to many other kinds of knowledge that we encode in text. This course will cover search engine technologies, which play an important role in any data mining applications involving text data for two reasons…

Read the complete description

Frequently asked questions

There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.

Didn't find what you were looking for? See also: Reading & Writing, Exam Preparation, Microsoft SQL Server, Microsoft Windows Server 2003/2008, and PL/SQL.

When you enroll for courses through Coursera you get to choose for a paid plan or for a free plan

  • Free plan: No certicification and/or audit only. You will have access to all course materials except graded items.
  • Paid plan: Commit to earning a Certificate—it's a trusted, shareable way to showcase your new skills.

About this course: Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets. Text data are unique in that they are usually generated directly by humans rather than a computer system or sensors, and are thus especially valuable for discovering knowledge about people’s opinions and preferences, in addition to many other kinds of knowledge that we encode in text. This course will cover search engine technologies, which play an important role in any data mining applications involving text data for two reasons. First, while the raw data may be large for any particular problem, it is often a relatively small subset of the data that are relevant, and a search engine is an essential tool for quickly discovering a small subset of relevant text data in a large text collection. Second, search engines are needed to help analysts interpret any patterns discovered in the data by allowing them to examine the relevant original text data to make sense of any discovered pattern. You will learn the basic concepts, principles, and the major techniques in text retrieval, which is the underlying science of search engines.

Created by:  University of Illinois at Urbana-Champaign
  • Taught by:  ChengXiang Zhai, Professor

    Department of Computer Science
Basic Info Course 2 of 6 in the Data Mining Specialization Language English How To Pass Pass all graded assignments to complete the course. User Ratings 4.4 stars Average User Rating 4.4See what learners said Coursework

Each course is like an interactive textbook, featuring pre-recorded videos, quizzes and projects.

Help from your peers

Connect with thousands of other learners and debate ideas, discuss course material, and get help mastering concepts.

Certificates

Earn official recognition for your work, and share your success with friends, colleagues, and employers.

University of Illinois at Urbana-Champaign The University of Illinois at Urbana-Champaign is a world leader in research, teaching and public engagement, distinguished by the breadth of its programs, broad academic excellence, and internationally renowned faculty and alumni. Illinois serves the world by creating knowledge, preparing students for lives of impact, and finding solutions to critical societal needs.

Syllabus


WEEK 1


Orientation
You will become familiar with the course, your classmates, and our learning environment. The orientation will also help you obtain the technical skills required for the course.


2 videos, 6 readings, 1 practice quiz expand


  1. Reading: Welcome to Text Retrieval and Search Engines!
  2. Reading: Syllabus
  3. Reading: About the Discussion Forums
  4. Reading: Updating your Profile
  5. Reading: Social Media
  6. Reading: Course Errata
  7. Video: Course Welcome Video
  8. Video: Course Introduction Video
  9. Practice Quiz: Pre-Quiz

Graded: Orientation Quiz

Week 1
During this week's lessons, you will learn of natural language processing techniques, which are the foundation for all kinds of text-processing applications, the concept of a retrieval model, and the basic idea of the vector space model.


6 videos, 1 reading, 1 practice quiz expand


  1. Reading: Week 1 Overview
  2. Video: Lesson 1.1: Natural Language Content Analysis
  3. Video: Lesson 1.2: Text Access
  4. Video: Lesson 1.3: Text Retrieval Problem
  5. Video: Lesson 1.4: Overview of Text Retrieval Methods
  6. Video: Lesson 1.5: Vector Space Model - Basic Idea
  7. Video: Lesson 1.6: Vector Space Retrieval Model - Simplest Instantiation
  8. Practice Quiz: Week 1 Practice Quiz

Graded: Week 1 Quiz

WEEK 2


Week 2



In this week's lessons, you will learn how the vector space model works in detail, the major heuristics used in designing a retrieval function for ranking documents with respect to a query, and how to implement an information retrieval system (i.e., a search engine), including how to build an inverted index and how to score documents quickly for a query.


6 videos, 1 reading, 1 practice quiz expand


  1. Reading: Week 2 Overview
  2. Video: Lesson 2.1: Vector Space Model - Improved Instantiation
  3. Video: Lesson 2.2: TF Transformation
  4. Video: Lesson 2.3: Doc Length Normalization
  5. Video: Lesson 2.4: Implementation of TR Systems
  6. Video: Lesson 2.5: System Implementation - Inverted Index Construction
  7. Video: Lesson 2.6: System Implementation - Fast Search
  8. Practice Quiz: Week 2 Practice Quiz

Graded: Week 2 Quiz

WEEK 3


Week 3



In this week's lessons, you will learn how to evaluate an information retrieval system (a search engine), including the basic measures for evaluating a set of retrieved results and the major measures for evaluating a ranked list, including the average precision (AP) and the normalized discounted cumulative gain (nDCG), and practical issues in evaluation, including statistical significance testing and pooling.


6 videos, 2 readings, 1 practice quiz expand


  1. Reading: Week 3 Overview
  2. Video: Lesson 3.1: Evaluation of TR Systems
  3. Video: Lesson 3.2: Evaluation of TR Systems - Basic Measures
  4. Video: Lesson 3.3: Evaluation of TR Systems - Evaluating Ranked Lists - Part 1
  5. Video: Lesson 3.4: Evaluation of TR Systems - Evaluating Ranked Lists - Part 2
  6. Video: Lesson 3.5: Evaluation of TR Systems - Multi-Level Judgements
  7. Video: Lesson 3.6: Evaluation of TR Systems - Practical Issues
  8. Practice Quiz: Week 3 Practice Quiz
  9. Reading: Programming Assignments Overview

Graded: Week 3 Quiz
Graded: Programming Assignment 1

WEEK 4


Week 4



In this week's lessons, you will learn probabilistic retrieval models and statistical language models, particularly the detail of the query likelihood retrieval function with two specific smoothing methods, and how the query likelihood retrieval function is connected with the retrieval heuristics used in the vector space model.


7 videos, 1 reading, 1 practice quiz expand


  1. Reading: Week 4 Overview
  2. Video: Lesson 4.1: Probabilistic Retrieval Model - Basic Idea
  3. Video: Lesson 4.2: Statistical Language Model
  4. Video: Lesson 4.3: Query Likelihood Retrieval Function
  5. Video: Lesson 4.4: Statistical Language Model - Part 1
  6. Video: Lesson 4.5: Statistical Language Model - Part 2
  7. Video: Lesson 4.6: Smoothing Methods - Part 1
  8. Video: Lesson 4.7: Smoothing Methods - Part 2
  9. Practice Quiz: Week 4 Practice Quiz

Graded: Week 4 Quiz

WEEK 5


Week 5



In this week's lessons, you will learn feedback techniques in information retrieval, including the Rocchio feedback method for the vector space model, and a mixture model for feedback with language models. You will also learn how web search engines work, including web crawling, web indexing, and how links between web pages can be leveraged to score web pages.


8 videos, 1 reading, 1 practice quiz expand


  1. Reading: Week 5 Overview
  2. Video: Lesson 5.1: Feedback in Text Retrieval
  3. Video: Lesson 5.2: Feedback in Vector Space Model - Rocchio
  4. Video: Lesson 5.3: Feedback in Text Retrieval - Feedback in LM
  5. Video: Lesson 5.4: Web Search: Introduction & Web Crawler
  6. Video: Lesson 5.5: Web Indexing
  7. Video: Lesson 5.6: Link Analysis - Part 1
  8. Video: Lesson 5.7: Link Analysis - Part 2
  9. Video: Lesson 5.8: Link Analysis - Part 3
  10. Practice Quiz: Week 5 Practice Quiz

Graded: Week 5 Quiz

WEEK 6


Week 6



In this week's lessons, you will learn how machine learning can be used to combine multiple scoring factors to optimize ranking of documents in web search (i.e., learning to rank), and learn techniques used in recommender systems (also called filtering systems), including content-based recommendation/filtering and collaborative filtering. You will also have a chance to review the entire course.


10 videos, 1 reading, 1 practice quiz expand


  1. Reading: Week 6 Overview
  2. Video: Lesson 6.1: Learning to Rank - Part 1
  3. Video: Lesson 6.2: Learning to Rank - Part 2
  4. Video: Lesson 6.3: Learning to Rank - Part 3
  5. Video: Lesson 6.4: Future of Web Search
  6. Video: Lesson 6.5: Recommender Systems: Content-Based Filtering - Part 1
  7. Video: Lesson 6.6: Recommender Systems: Content-Based Filtering - Part 2
  8. Video: Lesson 6.7: Recommender Systems: Collaborative Filtering - Part 1
  9. Video: Lesson 6.8: Recommender Systems: Collaborative Filtering - Part 2
  10. Video: Lesson 6.9: Recommender Systems: Collaborative Filtering - Part 3
  11. Video: Lesson 6.10: Course Summary
  12. Practice Quiz: Week 6 Practice Quiz

Graded: Week 6 Quiz
Graded: Programming Assignment 2
There are no reviews yet.

    Share your review

    Do you have experience with this course? Submit your review and help other people make the right choice. As a thank you for your effort we will donate £1.- to Stichting Edukans.

    There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.