StudentShare solutions
Triangle menu

Scientific Information Retrieval - Essay Example

Nobody downloaded yet

Extract of sample
Scientific Information Retrieval

The common search or Boolean query that computer users do everyday is a submission of a term to search engine which is programmed with a Boolean algorithm which finds documents with the term we included in the search and it is supported by an index containing all terms in the database. The simple form of Boolean query,
which is efficiently implemented over large databases, suffers several limitations: The number of retrieved documents is typically prohibitively large. A substantial part of the retrieved documents is irrelevant to the user's information need.
A broadly used alternative to the Boolean query is the similarity query, which is typically based on the vector-space model. Under this setting, documents are viewed as (algebraic) vectors over terms. A query, q, may consist of many terms, and even comprise a complete document. It too is viewed as a body of text, rather than merely as a search-terms combination and is represented as a vector as well. The retrieval task reduces to searching the database for document-vectors that are most similar to the query-vector. Other approaches based on the vector-space model also aim to reduce the dependency of the retrieved documents on the particular choice of query terms, and effectively improve retrieval. One way to do this is through the reweighting of query terms, where terms occurring within relevant documents
receive a higher weight than those occurring in irrelevant ones. This process is called relevance feedback.

1c. Text categorization
A task often addressed by information retrieval systems is that of text categorization. This is the labeling of text by category-tags from a predefined set of categories. There are two approaches to text categorization: The first is the knowledge-engineer approach, a set of rules are encoded to determine the categorization of the data base by an engineer who consults with an expert with knowledge of the information in the data base and makes the rules. This method has a fallback in that the rules must be continually revised to keep up with information in the data and results in what is know as the knowledge engineering bottleneck. The other is the machine learning (ML) approach, where a text classifier is viewed as a function learnt by an inductive process, from a training set of example documents, already classified into a predefined set of categories. ML-based classification is partitioned into two types: hard and soft classification. Under hard classification a document is strictly assigned to a single category. In contrast, soft classification entails a ranking by relevance of the categories for each document. Under this approach, the classifier returns a number between 0 and 1 (called the categorization status value, CSV).1

History of the internet

Just briefly touching on this subject, a series of memos were written by J.C.R. Licklider of MIT who envisioned what he called the "Galactic Net" Leonard Kleinrock MIT published the first paper on packet switching ...Show more

Summary

This paper will attempt to explain how new computer information retrieval (IR) methods have become an essential tool for the scientific community. The vast body of data created by the sciences has created a problem, namely how can this data be created to usable knowledge Data is distributed worldwide, and works from various universities and institutes are published and posted weekly, daily, even hourly in thousands of journals and reports…
Author : beahanantwan
Scientific Information Retrieval essay example
Read Text Preview
Save Your Time for More Important Things
Let us write or edit the essay on your topic
"Scientific Information Retrieval"
with a personal 20% discount.
Grab the best paper

Check these samples - they also fit your topic

Retrieval Medicine for Critical care Paramedic
Triage resource allocation and high level clinical oversight are key elements to this process. The following incidents need to be considered as being independent of each other. In each case the decision to mobilize, coordinate and support a retrieval team rests solely with you.
20 pages (5000 words) Essay
Retrieval medicine
Police officers can be assigned to secure the perimeter and they are expected to implement crowd control while the medical team is responding to the victims of the accident. Fire and rescue team will be assigned to check on the safety of the exact accident area.
16 pages (4000 words) Essay
Media Communication of Scientific and Environmental Information
The media should therefore have the capacity to present accurate and dependable information without bias whatsoever. However, there is the debate on how news should be disseminated and whether journalists present scientific findings appropriately. When reporting, the media usually assume a general audience.
4 pages (1000 words) Essay
Information System
Internet and computers have facilitated the change in means of communication among people of different people, whereby communication and accessibility of information is effective. Therefore, people are able to communicate and share their ideas from different parts of the world, hence creating an opportunity for people knowing about other countries.
8 pages (2000 words) Essay
Information Management Master Essay
To do this, organizations should organize and carry out a comprehensive records management program. Records should have the characteristics of Authenticity, Reliability, Integrity and Usability. To recover the losses of flood, there should have some changes like- documenting records transactions, physical storage medium and protection, distributed management, conversion and migration, access; retrieval and use, retention and disposition3.
15 pages (3750 words) Essay
Information Retrieval & Knowledge Management
In Leader's Change Handbook, Conger, Spreitzer, and Lawler (1999, p.361), suggests that there is an emerging consensus and advocacy for organizations to be "excellent at knowledge creation and management." The authors write, "effective organizations need to grow not just as individuals but their own intellectual capital and property and their ability to deploy them effectively".
8 pages (2000 words) Essay
Coursework for information retrieval knowledge management course
In Leaders Change Handbook, Conger, Spreitzer, and Lawler (1999, p.361), suggests that there is an emerging consensus and advocacy for organizations to be "excellent at
8 pages (2000 words) Essay
Information Technology assignment unit 4
So here the selections for the system configurations for the concerning firm are: Here I have chosen a printer that can print 1200 dots per inch so this provides very
2 pages (500 words) Essay
GEOGRAPHIC INFORMATION SYSTEMS (GIS)
Various GIS software has been developed to help in modeling landscapes and allocation of various factors of development through spatial organization. As a specialist in GIS, I can use the application in many different but beneficial ways
4 pages (1000 words) Essay
Who gets access to the information and technologies that (scientific) research makes possible
Scientific research emanating from information and technologies comes with unexpected benefits, costs and risks that fall on diverse social groups at diverse times (Kaplan, 2004, 487-489). Thus, effects of technology are as significant as developing its proficiencies and their accessibility are limited to specific personalities and research institutions.
1 pages (250 words) Essay
Hire a pro to write
a paper under your requirements!
Win a special DISCOUNT!
Put in your e-mail and click the button with your lucky finger
Your email
YOUR PRIZE:
Apply my DISCOUNT
Comments (0)
Click to create a comment