Stars
vnStat - a network traffic monitor for Linux and BSD
Nezha server over Argo tunnel 使用 Argo 隧道的哪吒服务端
The best Java open source crypto currency exchange platform, bitcoin exchange based on Java | BTC exchange | ETH exchange | digital currency exchange | trading platform | matching trading engine. T…
Open-Source Cloud-Native Digital Asset & Cryptocurrency Exchange Platform
Countries, Languages & Continents data (capital and currency, native name, calling codes).
Node.js module for i18n apps - query any country's spoken languages or find countries where a language is spoken.
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Web Extension for saving a faithful copy of a complete web page in a single HTML file
Gatsby Remark plugin to embed well known services by their URL.
🎭 Playwright integration for Scrapy
Headless chrome/chromium automation library (unofficial port of puppeteer)
To extract main article from given URL with Node.js
今日头条新闻详情页面爬取,逆向 Cookies 中 __ac_signature 生成过程
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
JavaScript domain name parser based on the Public Suffix List
Python version of the Playwright testing and automation library.
This repository provides very basic flask, streamlit, and docker examples for the llama_index (fka gpt_index) package
DocsGPT is an open-source genAI tool that helps users get reliable answers from knowledge source, while avoiding hallucinations. It enables private and reliable information retrieval, with tooling …
Scrape websites for text by CSS selector.
A little repo that help you craw all subtitle of a anime seasion on Bilibili and convert it to SRT (SubRip file format) format
Article extraction benchmark: dataset and evaluation scripts
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Automatically check for GDPR/CCPA consent by running a Playwright headless browser to check for marketing and analytics scripts firing before and after consent.
A standalone version of the readability lib