8000 GitHub - autumn-ma/midori
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

autumn-ma/midori

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

Midori

System design and implementation for a web crawler.

Midori

System Design

High Level Design

This will be the high level flow of the data, initally the api will call DB, which will inturn call the crawler which will fetch html pages and parse information. The crawler also needs URLs to fetch html pages which would be in our URL DB, we would initally feed some pages to the URL DB but it will be self sustaining loop where crawler would provide URLs to URL DB after parsing webpages.

Untitled-2023-03-17-1604(8)

Low Level Design

API interface

Untitled-2023-03-17-1604(9)

Database

Schema and APIs

Untitled-2023-04-20-1131(12)

Main Table

Untitled-2023-04-20-1131(8)

Global Index

Untitled-2023-04-20-1131(9)

Text Index

Untitled-2023-04-20-1131(10)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0