HomePage |
arXiv |
Demo |
Citation
This repository is the official implementation of FoodSky.
FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
Pengfei Zhou, Weiqing Min*, Chaoran Fu, Ying Jin, Mingyu Huang, Xiangyang Li, Shuhuan Mei, and Shuqiang Jiang*
* Correponding authors.
FoodSky is a foundamental LLM specifically designed for the world of food, marking a significant step forward in food computing. As the first Chinese LLM built just for food-related topics, FoodSky uses a vast collection of food data called FoodEarth, which includes everything from recipes to nutritional information, sourced from trusted books and websites. This model isn't just about handling huge amounts of data—it’s smart in understanding and discussing food like a professional. Whether it's passing chefs' exams or providing diet advice, FoodSky shows impressive skills, making it a reliable resource for anyone in the food industry, from chefs to dietitians. Its aim is to make working with food more creative and healthy, helping users across various scenarios with accurate and culturally aware advice. This tool is set to change how we interact with food information, blending detailed food knowledge with cutting-edge technology to better serve food enthusiasts and professionals alike.
Please refer to Quick Start Guide for detailed instructions on evaluation, training, and other resources.
- NLP Metrics: For BLEU, GLEU and ROUGE evaluation, check the
inference_nlp
directory. Example usage ininference_nlp/all_test.sh
. - CDE Testing: Code execution evaluation is available in
test/cde-eval/test.sh
. - GPT Evaluations: GPT model evaluation examples can be found in
test/gpt-eval
.
Refer to train/train_scripts/README.md
for detailed training instructions covering:
- Resource optimization (Full-tuning, LoRA, QLoRA)
- Advanced algorithms (GaLore, BAdam, Adam-mini, DoRA, LongLoRA)
- Practical tricks (FlashAttention-2, Unsloth, Liger Kernel, RoPE scaling, NEFTune, rsLoRA)
- Experiment monitoring tools
- Release the test data and test code.
- Release the inference code.
- Release the training code and models.
- Release a version of FoodEarth dataset.
- Release a bilingual version of FoodSky and FoodEarth.
We provide the FoodSky models in Zenodo. To prevent server overload, our demo system requires authentication. If you are interested in accessing our work, please contact us through the following channels to request credentials:
- 📧 Email: zpf4wp@outlook.com
- 🌐 Homepage: http://123.57.42.89/FoodComputing__Home.html
If you feel FoodSky insteresting, feel free to use the following BibTeX entry to cite our paper. Thanks!
@misc{zhou2024foodskyfoodorientedlargelanguage,
title={FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination},
author={Pengfei Zhou and Weiqing Min and Chaoran Fu and Ying Jin and Mingyu Huang and Xiangyang Li and Shuhuan Mei and Shuqiang Jiang},
year={2024},
eprint={2406.10261},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2406.10261},
}