Information-searching activity is one of the fundamental aspects of our modern lives. The Internet allows us to enrich knowledge by accessing online resources such as text documents, images, sound and video files and geographic information.
There are 3 different points of view for information retrieval (IR). The first one is a view from users. Users search information in order to enrich their inadequate knowledge. Another perspective is from system. The IR system allows us to store, manipulate and retrieve information. The last one is a view from sources which is related to presentation of information.
Indexing
In order to let users to find what they are interested in effectively, indexing is very important. To achieve this, there are several steps. The first process is identifying fields such as Title and Author. Once it has been done, users can narrow their search according to their criteria. The second step is identifying words. In this step, words are split up by spaces. The third step is stemming in which suffixes are eliminated. For example cook, cooking, cooked and cooks are combined to cook. In addition to this, extra step is synonym which is a term with almost same meaning as another term.
Search Models
Users can achieve documents they are interested in by using search engine which is based on a retrieval mechanism. Boolean is a dominant logic behind the retrieval mechanism. The combination of three operators which are AND, OR, and NOT allows users to narrow the search get relevant resources effectively. The following are examples
- swimming AND running: result includes documents contain both swimming and running.
- swimming OR running: result includes documents contain both swimming and runninge or one of them.
- swimming NOT running: result includes documents contain swimming and not contain running.
No comments:
Post a Comment