Stars
This is a repository for the paper: Zilio, L., Lazzari R.R., Finatto, M.J.B. (2024) NLP for historical Portuguese: Analysing 18th-century medical texts. In Proceedings of PROPOR 2024.
A centralized location for storing curated data from cBioPortal
A Python library for evaluating the quality of synthetic medical data
The project leverages Apache Flink, Apache Kafka and Python digital Twin to provide real-time insights into healthcare data, enabling timely interventions and proactive patient care.
Generate realistic medical history for digital twins of human patients
Synthetic Patient Clinical (CCDA) Documents
A comprehensive synthetic health monitoring dataset featuring time-series health metrics for 100 patients, collected at 10-minute intervals. Ideal for healthcare-related machine learning applicatio…
A demonstration of synthetic data generation, in a healthcare example
Synthetic Population Catalyst
A lightweight package for generating visually distinct colours.
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line applica…
A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.
Applying Natural Language Processing (NLP) Tools to Assess LGBTQ+ Research Gaps in Tobacco Control Literature
Notebooks using the Hugging Face libraries 🤗
🏥 Medical Text Mining and Information Extraction with spaCy
The Complete NLP Guide: Text to Context
E3C is a freely available multilingual corpus (Italian, English, French, Spanish, and Basque) of semantically annotated clinical narratives to allow for the linguistic analysis, benchmarking, and t…
🔍 Clinical cases search by similarity specialized in Covid-19
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
NUBIA (NeUral Based Interchangeability Assessor) is a new SoTA evaluation metric for text generation
MedGraph is a project focused to construct biomedical knowledge graph. It harnesses the power of pubMed for data retrieval, spaCy for NLP, Mondo Ontology for semantic enrichment, and pywikibot for …
Clinical Natural Language Processing using spaCy, scispacy, and medspacy
A full spaCy pipeline and models for scientific/biomedical documents.