EVALUASI EFEKTIVITAS PENCARIAN DOKUMEN HTML PADA PENERAPAN STEMMED TERM VECTOR MODEL DENGAN PEMBOBOTAN LOGARITMA FREKUENSI TERM DALAM DOKUMEN DAN PENGUKURAN SOKAL

Angga Kusuma Nugraha; Yesi Puspita Dewi

doi:10.36080/bit.v15i1.674

EVALUASI EFEKTIVITAS PENCARIAN DOKUMEN HTML PADA PENERAPAN STEMMED TERM VECTOR MODEL DENGAN PEMBOBOTAN LOGARITMA FREKUENSI TERM DALAM DOKUMEN DAN PENGUKURAN SOKAL

Angga Kusuma Nugraha, Yesi Puspita Dewi

Abstract

Purpose of this study was to evaluate the effectiveness of an information retrieval system using the sokal / sneath equation and vector method by comparing between one system model and another. Number of documents used 100 documents taken from the internet with 10 different topics. Algorithms used include algorithm for extact, tokenization, stopword removal, stemming, term weighting and similarity calculation. An information acquisition system is a system that automatically searches for information that is relevant to user needs. Finally, in measuring the relevance of the document to the query is done using the calculation of the average precision and recall. The calculations are done manually. From the calculation of the average precision and recall will get a collection point of effectiveness later on at the end of this study will be described in graphical form. Furthermore, the graph is incorporated into several graphs of precision and recall in other related studies, for comparison in obtaining final conclusions..

Keywords

Information retrieval, Precision and recall, Similarity Calculation, Stemming, Sokal, Stopword removal, Tokenization

Full Text:

PDF

DOI: https://dx.doi.org/10.36080/bit.v15i1.674

Refbacks

There are currently no refbacks.

This work is licensed under a Creative Commons Attribution 4.0 International License.

OFFICE:

FAKULTAS TEKNOLOGI INFORMASI - UNIVERSITAS BUDI LUHUR, Jl. Ciledug Raya, Petukangan Utara, Jakarta Selatan, 12260. DKI Jakarta, Indonesia. Telp: 021-585 3753 Fax: 021-585 3752

Bit (Fakultas Teknologi Informasi Universitas Budi Luhur) by FAKULTAS TEKNOLOGI INFORMASI - UNIVERSITAS BUDI LUHUR is licensed under CC BY-SA 4.0 Creative Commons License

View Bit (Fakultas Teknologi Informasi Universitas Budi Luhur) Satats Web Analytics Made Easy