Skip to content

LanceZPF/FoodSky

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination

HomePage | arXiv | Demo | Citation

This repository is the official implementation of FoodSky.

FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
Pengfei Zhou, Weiqing Min*, Chaoran Fu, Ying Jin, Mingyu Huang, Xiangyang Li, Shuhuan Mei, and Shuqiang Jiang*

* Correponding authors.

Introduction

FoodSky is a foundamental LLM specifically designed for the world of food, marking a significant step forward in food computing. As the first Chinese LLM built just for food-related topics, FoodSky uses a vast collection of food data called FoodEarth, which includes everything from recipes to nutritional information, sourced from trusted books and websites. This model isn't just about handling huge amounts of data—it’s smart in understanding and discussing food like a professional. Whether it's passing chefs' exams or providing diet advice, FoodSky shows impressive skills, making it a reliable resource for anyone in the food industry, from chefs to dietitians. Its aim is to make working with food more creative and healthy, helping users across various scenarios with accurate and culturally aware advice. This tool is set to change how we interact with food information, blending detailed food knowledge with cutting-edge technology to better serve food enthusiasts and professionals alike. overview

🚀 Quick Start Guide

Please refer to Quick Start Guide for detailed instructions on evaluation, training, and other resources.

Evaluation

  • NLP Metrics: For BLEU, GLEU and ROUGE evaluation, check the inference_nlp directory. Example usage in inference_nlp/all_test.sh.
  • CDE Testing: Code execution evaluation is available in test/cde-eval/test.sh.
  • GPT Evaluations: GPT model evaluation examples can be found in test/gpt-eval.

Training

Refer to train/train_scripts/README.md for detailed training instructions covering:

  • Resource optimization (Full-tuning, LoRA, QLoRA)
  • Advanced algorithms (GaLore, BAdam, Adam-mini, DoRA, LongLoRA)
  • Practical tricks (FlashAttention-2, Unsloth, Liger Kernel, RoPE scaling, NEFTune, rsLoRA)
  • Experiment monitoring tools

🎯 TODO List

  • Release the test data and test code.
  • Release the inference code.
  • Release the training code and models.
  • Release a version of FoodEarth dataset.
  • Release a bilingual version of FoodSky and FoodEarth.

📞 Contact

We provide the FoodSky models in Zenodo. To prevent server overload, our demo system requires authentication. If you are interested in accessing our work, please contact us through the following channels to request credentials:

🖊️ Citation

If you feel FoodSky insteresting, feel free to use the following BibTeX entry to cite our paper. Thanks!

@misc{zhou2024foodskyfoodorientedlargelanguage,
      title={FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination}, 
      author={Pengfei Zhou and Weiqing Min and Chaoran Fu and Ying Jin and Mingyu Huang and Xiangyang Li and Shuhuan Mei and Shuqiang Jiang},
      year={2024},
      eprint={2406.10261},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2406.10261}, 
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published