Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
seminar surveyer
Active In SP

Posts: 3,541
Joined: Sep 2010
27-01-2011, 02:38 PM

.pptx   Information Retrival.pptx (Size: 232.04 KB / Downloads: 91)

Sudheer reddy . B


Performance Measures
What IR Do-How ?
Traditional View of IR

History :

The idea of using computers to search for relevant pieces of information was popularized by the article “As We May Think” by Vannevar Bush in 1945.
The first automated information retrieval systems were introduced in the 1950s and 1960s.
In 1992, the US Department of Defense along with the NIST cosponsored the Text Retrieval Conference(TREC) program-Web Search Engines.

Overview :

An information retrieval process begins when a user enters a Query into the system.
Process may then be iterated if the user wishes to refine the query.

What IR Systems Try to Do ?

Predict, on the basis of some information about the user, and information about the knowledge resource, what information objects are likely to be the most appropriate for the user to interact with, at any particular time.

How IR Systems Try to Do This

Represent the user’s information problem (the query)
Represent (surrogate) and organize (classify) the contents of the knowledge resource
Compare query to surrogates (predict relevance)
Present results to the user for interaction/judgment

Performance measures :

Traditional goal of IR is to retrieve all and only the relevant IOs in response to a query.
All is measured by recall: the proportion of relevant IOs in the collection which are retrieved
Only is measured by precision: the proportion of retrieved IOs which are relevant

seminar class
Active In SP

Posts: 5,361
Joined: Feb 2011
11-03-2011, 02:21 PM

.ppt   ch19.ppt (Size: 258 KB / Downloads: 42)
Information Retrieval Systems
n Information retrieval (IR) systems use a simpler data model than database systems
l Information organized as a collection of documents
l Documents are unstructured, no schema
n Information retrieval locates relevant documents, on the basis of user input such as keywords or example documents
l e.g., find documents containing the words “database systems”
n Can be used even on textual descriptions provided with non-textual data such as images
n Web search engines are the most familiar example of IR systems
n Differences from database systems
l IR systems don’t deal with transactional updates (including concurrency control and recovery)
l Database systems deal with structured data, with schemas that define the data organization
l IR systems deal with some querying issues not generally addressed by database systems
n Approximate searching by keywords
n Ranking of retrieved answers by estimated degree of relevance
Keyword Search
n In full text retrieval, all the words in each document are considered to be keywords.
l We use the word term to refer to the words in a document
n Information-retrieval systems typically allow query expressions formed using keywords and the logical connectives and, or, and not
l Ands are implicit, even if not explicitly specified
n Ranking of documents on the basis of estimated relevance to a query is critical
l Relevance ranking is based on factors such as
 Term frequency
– Frequency of occurrence of query keyword in document
 Inverse document frequency
– How many documents the query keyword occurs in
» Fewer è give more importance to keyword
 Hyperlinks to documents
– More links to a document è document is more important
Relevance Ranking Using Terms
n TF-IDF (Term frequency/Inverse Document frequency) ranking:
l Let n(d) = number of terms in the document d
l n(d, t) = number of occurrences of term t in the document d.
l Relevance of a document d to a term t
 The log factor is to avoid excessive weight to frequent terms
Relevance of document to query Q
n Most systems add to the above model
l Words that occur in title, author list, section headings, etc. are given greater importance
l Words whose first occurrence is late in the document are given lower importance
l Very common words such as “a”, “an”, “the”, “it” etc are eliminated
 Called stop words
l Proximity: if keywords in query occur close together in the document, the document has higher importance than if they occur far apart
n Documents are returned in decreasing order of relevance score
l Usually only top few documents are returned, not all
Similarity Based Retrieval
n Similarity based retrieval - retrieve documents similar to a given document
l Similarity may be defined on the basis of common words
 E.g. find k terms in A with highest TF (d, t ) / n (t ) and use these terms to find relevance of other documents.
n Relevance feedback: Similarity can be used to refine answer set to keyword query
l User selects a few relevant documents from those retrieved by keyword query, and system finds other documents similar to these
n Vector space model: define an n-dimensional space, where n is the number of words in the document set.
l Vector for document d goes from origin to a point whose i th coordinate is TF (d,t ) / n (t )
l The cosine of the angle between the vectors of two documents is used as a measure of their similarity.

Important Note..!

If you are not satisfied with above reply ,..Please


So that we will collect data for you and will made reply to the request....OR try below "QUICK REPLY" box to add a reply to this page

Quick Reply
Type your reply to this message here.

Image Verification
Please enter the text contained within the image into the text box below it. This process is used to prevent automated spam bots.
Image Verification
(case insensitive)

Possibly Related Threads...
Thread Author Replies Views Last Post
  latest seminar topics for information technology jaseelati 0 375 23-02-2015, 03:17 PM
Last Post: jaseelati
  seminar topics for information technology with ppt jaseelati 0 341 21-02-2015, 04:16 PM
Last Post: jaseelati
  information technology seminar topics jaseelati 0 334 07-02-2015, 12:58 PM
Last Post: jaseelati
  pictorial presentation and information about a mall jaseelati 0 373 29-11-2014, 01:09 PM
Last Post: jaseelati
  Information Theoretic Framework of Trust Modeling and Evaluation for Ad Hoc Net seminar projects maker 0 405 30-09-2013, 04:22 PM
Last Post: seminar projects maker
  Report on Management Information Systems seminar projects maker 0 537 14-09-2013, 04:26 PM
Last Post: seminar projects maker
  INFORMATION SECURITY project topics 3 2,408 13-09-2013, 09:29 AM
Last Post: seminar projects maker
  COUNTER MEASURES: INFORMATION WARFARE PPT seminar projects maker 0 311 11-09-2013, 02:32 PM
Last Post: seminar projects maker
  Wireless Information-Theoretic Security pdf study tips 0 350 28-08-2013, 03:49 PM
Last Post: study tips
  Efficient SCADA Module for Improving Medical Information Monitoring and Reliable pdf study tips 0 357 22-08-2013, 03:04 PM
Last Post: study tips