8000 GitHub - adalke/FPSim2: Simple package for fast molecular similarity searches
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
/ FPSim2 Public
forked from chembl/FPSim2

Simple package for fast molecular similarity searches

License

Notifications You must be signed in to change notification settings

adalke/FPSim2

 
 

Repository files navigation

Build Status License: MIT Anaconda-Server Badge Binder DOI

FPSim2: Simple package for fast molecular similarity searches

FPSim2 is a small NumPy centric Python/C++ package to run fast compound similarity searches. FPSim2 performs better with high search thresholds (>=0.7). Currently used in the ChEMBL interface.

Highlights:

  • Using a fast population count algorithm from libpopcnt libpopcnt
  • Bounds for sublinear speedups from 10.1021/ci600358f
  • A compressed file format with optimised read speed based in PyTables and BLOSC
  • Fast multicore CPU and GPU similarity searches
  • In memory and on disk search modes
  • Distance matrix calculation

Installation (it requires RDKit to work)

From source:

  • clone the repo
  • pip install FPSim2/

From a conda environment:

conda install -c efelix fpsim2

Documentation

Documentation is available at https://chembl.github.io/FPSim2/

Trying it online

To try out FPSim2 interactively in your web browser, just click on the binder Binder

About

Simple package for fast molecular similarity searches

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 80.1%
  • C++ 18.7%
  • Other 1.2%
0