Skip to content
View Hironsan's full-sized avatar
💤
Zzz
💤
Zzz

Organizations

@arXivTimes @chakki-works @doccano

Block or report Hironsan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured data, assuming a real-world scenario. The sample aims to be …

Jupyter Notebook 32 9 Updated Nov 3, 2024

This lab is a 1-day/2-day end-to-end SLM workshop led and developed by AI GBB. Attendees will learn how to quickly and easily perform the data preparation-fine tuning-serving-LLMOps series of proce…

Jupyter Notebook 24 11 Updated Nov 3, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 5,267 358 Updated Oct 24, 2024

Omnivore is a complete, open source read-it-later solution for people who like reading.

TypeScript 13,513 842 Updated Nov 3, 2024

One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure

Python 1,821 295 Updated Nov 1, 2024

Benchmarking PDF libraries

Python 218 11 Updated Oct 31, 2023

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 860 35 Updated Oct 31, 2024

Tools for merging pretrained large language models.

Python 4,763 434 Updated Oct 31, 2024

The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)

Python 32 9 Updated Oct 31, 2024

Convert PDF to markdown quickly with high accuracy

Python 17,443 1,000 Updated Oct 31, 2024

SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.

Python 57 1 Updated May 3, 2024

A pytorch quantization backend for optimum

Python 816 61 Updated Oct 29, 2024

Train Models Contrastively in Pytorch

Python 538 39 Updated Oct 23, 2024

ReFT: Representation Finetuning for Language Models

Python 1,143 100 Updated Oct 22, 2024

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Python 8,284 1,402 Updated Oct 30, 2024

Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024

C++ 54 2 Updated Oct 1, 2024

Interactively explore unstructured datasets from your dataframe.

TypeScript 1,118 83 Updated Aug 5, 2024

https://graphacademy.neo4j.com/courses/llm-fundamentals/

Jupyter Notebook 60 22 Updated Jan 9, 2024

Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?

HTML 510 88 Updated Oct 25, 2024

The all-in-one solution for RAG. Build, scale, and deploy state of the art Retrieval-Augmented Generation applications

Python 3,513 263 Updated Nov 2, 2024
Jupyter Notebook 168 9 Updated Oct 9, 2024

minimal pytorch implementation of bm25 (with sparse tensors)

Python 88 4 Updated Mar 2, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,637 512 Updated Oct 18, 2024

A framework for prompt tuning using Intent-based Prompt Calibration

Python 2,152 184 Updated Oct 17, 2024

Open-source AI cookbook

Jupyter Notebook 1,661 236 Updated Oct 28, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 560 39 Updated Sep 22, 2024

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 13,626 855 Updated Oct 30, 2024

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 36,479 3,565 Updated Nov 1, 2024

Educational materials on deep learning by Weights & Biases

Jupyter Notebook 548 256 Updated Nov 1, 2024

Superfast AI decision making and intelligent processing of multi-modal data.

Python 2,064 213 Updated Nov 2, 2024
Next