You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A specialized toolkit for Information Retrieval and Web Analytics. This rep covers the architecture of search engines, featuring custom implementations of inverted and positional indexing, Boolean retrieval, and text preprocessing pipelines. It includes N-grams analysis, cosine similarity foundations, and advanced NLP tokenization techniques.
An information retrieval system for boolean queries, proximity quries and wildcard queries using Inverted indexing, Biword indexing, positional indexing and soundex indexing.
This is a basic search engine written in Python for ranking documents based on the proximity of query terms, featuring an indexing and ranked retrieval system for textual data.