Friday, December 11, 2015

Friday, November 27, 2015

Muddiest Point For Week 13

What will be a good example for adaptive information retrieval system?

Reading Notes For Week 13

IIR

  • Most language-modeling work in IR has used unigram language models. IR is not the place where you most immediately need complex language models. Unigram models are often sufficient to judge the topic of a text.
  • Language modeling is a quite general formal approach to IR, with many variant realizations. The original and basic method for using language models in IR is the query likelihood model.
  • Vector space systems have generally preferred more lenient matching, though recent web search developments have tended more in the direction of doing searches with such conjunctive semantics. 
  • Group-average agglomerative clustering is avoiding the pitfalls of the single-link and complete-link criteria, which equate cluster similarity with the similarity of a single pair of documents.
  • Flat clustering creates a flat set of clusters without any explicit structure that would relate clusters to each other. Hierarchical clustering creates a hierarchy of clusters.
  •  The inverted index supports fast nearest-neighbor search for the standard IR setting. However, sometimes we may not be able to use an inverted index efficiently.
  • Feature selection makes training and applying a classifier more efficient by decreasing the size of the effective vocabulary.
  • Differential cluster labeling selects cluster labels by comparing the distribution of terms in one cluster with that of other clusters.

Friday, November 20, 2015

Reading Notes For Week 12

User Profiles

  1. classification: the way information is collected, the life period of the profile and structure.
  2. five basic approaches to user identification: software agents, logins, enhanced proxy serves, cookies and session ids.
  3. The searches is not limited to the Web, but they would also include databases to which the user has access, and the users personal documents. Such search systems are implemented in tools like Google Desktop Search.
  4. User identification can be obtained using mechanisms such as session ids or cookies that provide anonymity. Even methods requiring a login process can be anonymous if users are be allowed to use pseudonyms rather than their true identity. 
  5. In user customization, a recommendation system provides an interface that allows users to construct a representation of their own interests. Often check boxes are used to allow a user to select from the known values of attributes,
  6. Content-based recommendation systems recommend an item to a user based upon a description of the item and a profile of the user’s interests. While a user profile may be entered by the user, it is commonly learned from feedback the user provides on items. 
  7. Personalized Web search has emerged as one of the hottest topics for both the Web industry and academic researchers. 

Muddiest Point For Week 12

How to evaluate the quality of a web search?