Mini Search Engine (Study Case : News Question About COVID-19)
Steps :
- Download Dataset (Crawling)
- Preprocessing Dataset
- Case Folding
- Tokenizing
- Stemming
- Stop Words
- Re Join Words
- Indexing / Pembobotan TF-IDF
- TF - IDF (Sklearn)
- Retrieval / Cosine Similarity
- Cosine Similarity (Sklearn)
- Perankingan / TOP 10
- Sorting from the value closest to the similarity value