8000 mdic's list / 🌽 corpy · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View mdic's full-sized avatar
🐜
🐜

Organizations

@archiviofontiorali

Block or report mdic

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

🌽 corpy

6 repositories

search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance base…

Python 2 Updated Dec 24, 2022

A basic search engine to index a corpus for searching and rank the document data set.

Python 3 Updated Mar 14, 2023

Various Indexing and Query Based Retrieval Models and Page-rank Algorithm in Python 3.0

Python 3 3 Updated Apr 23, 2024

Search Engine built using Flask, HTML, CSS and MongoDB using an inverted index (TF-IDF scoring).

Python 4 1 Updated Dec 15, 2023

Built a search engine from scratch for a Wikipedia corpus of over 21 million articles (85 Gb) to give search results within 4 seconds. Parsed Wikipedia pages into tokens by applying appropriate tec…

Python 1 Updated Nov 28, 2021

Open source Python package to produce word sketches inspired by Sketch Engine (to make reproducible analyses)

GLSL 4 1 Updated Aug 30, 2024
0