finetuning-llms

Medical Language Model fine-tuned using pretraining, instruction tuning, and Direct Preference Optimization (DPO). Progresses from general medical knowledge to specific instruction following, with experiments in preference alignment for improved medical text generation and understanding.

llm llm-training finetuning-llms

Updated Oct 4, 2024
Jupyter Notebook

garyfanhku / Galore-pytorch

Star

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

pytorch lora low-rank-approximation peft large-language-models llm low-rank-adaptation finetuning-llms galore

Updated Mar 7, 2024
Python

zhaoyl18 / SEIKO

Star

SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.

reinforcement text-to-image online-learning finetuning diffusion-models stable-diffusion finetuning-llms

Updated Jul 18, 2024
Python

BaohaoLiao / ApiQ

Star

[EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs

quantization peft llm finetuning-llms

Updated Jul 18, 2024
Python

bhattbhavesh91 / google-gemma-finetuning-n2sql

Sponsor

Star

Finetuning Google's Gemma Model for Translating Natural Language into SQL

google lora gemma natural-language-to-sql fine-tuning finetuning supervised-finetuning finetuning-llms

Updated Feb 22, 2024
Jupyter Notebook

adithya-s-k / Indic-llm

Star

A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of domains and languages.

lora finetuning dpo llm finetuning-llms continual-pre-training

Updated May 27, 2024
Python

SaltyGod / Qwen-Qlora-ACSA

Star

qwen-1.5-1.8B sentiment analysis with prompt optimization and qlora fine-tuning

sentiment-analysis torch prompt-engineering qlora finetuning-llms qwen xtuner

Updated May 7, 2024
Python

louisc-s / QLoRA-Fine-tuning-for-Film-Character-Styled-Responses-from-LLM

Star

Code for fine-tuning Llama2 LLM with custom text dataset to produce film character styled responses

deep-learning chatbot lora peft parameter-efficient-tuning generative-ai qlora finetuning-llms llama2

Updated Jan 3, 2024
Python

zelaki / awesome-LoRA

Star

A curated list of Parameter Efficient Fine-tuning papers with a TL;DR

lora diffusion peft low-rank-adaptation finetuning-llms

Updated Sep 19, 2024

ShashankGupta10 / Code-Wizard

Star

Code Wizard is a coding companion/ code generation tool empowered by CodeLLama-v2-34B AI to automatically generate and enhance code based on best practices found in your GitHub repository.

langchain-python finetuning-llms llama2

Updated Mar 29, 2024
Python

PromptEngineer48 / Fine_tuning_1

Star

Finetuning LLMs + Private Data (Video 1/10) Basic

finetuning-llms

Updated Nov 11, 2023
Jupyter Notebook

Rahul-AkaVector / java-code-generator

Star

This repository contains code for fine-tuning the LLama3 8b model using Alpaca prompts to generate Java codes. The code is based on a Google Colab notebook.

finetune finetuning java-code-generator javacode finetuning-llms finetuning-large-language-models llama3 llama3-finetune

Updated Jun 19, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the finetuning-llms topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the finetuning-llms topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetuning-llms

Here are 73 public repositories matching this topic...

adithya-s-k / AI-Engineering.academy

GURPREETKAURJETHRA / END-TO-END-GENERATIVE-AI-PROJECTS

Itachi-Uchiha581 / Auto-Data

simplifine-llm / Simplifine

wangermeng2021 / llm-webui

neuralwork / instruct-finetune-mistral

BaohaoLiao / mefts

Prismadic / magnet

samadon1 / LLM-From-Scratch

garyfanhku / Galore-pytorch

zhaoyl18 / SEIKO

BaohaoLiao / ApiQ

bhattbhavesh91 / google-gemma-finetuning-n2sql

adithya-s-k / Indic-llm

SaltyGod / Qwen-Qlora-ACSA

louisc-s / QLoRA-Fine-tuning-for-Film-Character-Styled-Responses-from-LLM

zelaki / awesome-LoRA

ShashankGupta10 / Code-Wizard

PromptEngineer48 / Fine_tuning_1

Rahul-AkaVector / java-code-generator

Improve this page

Add this topic to your repo