Stars
starfish: unified pipelines for image-based transcriptomics
Spatialproteomics is a light weight wrapper around xarray with the intention to facilitate the data exploration and analysis of highly multiplexed immunohistochemistry data. Docs available here: ht…
A relabeling tool for tiled segmentation with Dask
https://sparrow-pipeline.readthedocs.io/en/latest/
An open and interoperable data framework for spatial omics data
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
napari plugin to deal with charging artifacts in tomography electron microscopy data
Power CLI and Workflow manager for LLMs (core package)
A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
Access a database of word frequencies, in various natural languages.
Machine learning model server that can predict AND train
INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
A tool for extracting plain text from Wikipedia dumps
Python library & examples for Masked Language Model Scoring (ACL 2020)
The Spanish Fake News Corpus contains a collection of 971 news divided into 491 real news and 480 fake news. The corpus covers news from 9 different topics: Science, Sport, Economy, Education, Ente…
Faker is a Python package that generates fake data for you.
Enhancing the BERT training with Semi-supervised Generative Adversarial Networks in Pytorch/HuggingFace
An NLP system for generating reading comprehension questions
A PyTorch implementation of Context Vector Data Description (CVDD), a method for Anomaly Detection on text.
Text anomaly detection with ARAE and AnoGAN in Tensorflow 2.0
Code for the paper "Adversarially Regularized Autoencoders (ICML 2018)" by Zhao, Kim, Zhang, Rush and LeCun
📝 python package to calculate readability statistics of a text object - paragraphs, sentences, articles.
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (KDD'21).
BERT, LDA, and TFIDF based keyword extraction in Python