Config.json Generator for Finetune Scripts

This script generates a config.json file for machine learning models, tailored specifically for complex finetuning scripts. It automates the process by scraping relevant information from the finetuning script and allows the user to customize and verify all configurations before saving the file.

Features

Script Scraping: Automatically extracts key configuration values from your finetuning script to fill in the config.json.
Existing Config Usage: Optionally load an existing config.json to use as a base template, pre-filling fields to minimize manual input.
Transformers Version Detection: Automatically detects the installed version of the transformers library and fills it in the config file. If transformers is not installed, it will prompt the user to install it.
Interactive User Prompts: Guides the user through any missing or unclear configuration fields, ensuring that only valid values are entered.
Optional Fields Handling: Identifies and excludes optional fields if not needed, streamlining the configuration process.
Final Review and Modification: After generating the config.json, the user is given a chance to review and modify any entries before saving.
Existing File Handling: If a config.json file already exists in the directory, the user can choose to overwrite it or save the new file with a different name.

Prerequisites

Python 3.x
Transformers Library: The script requires the transformers library to detect the version. If it's not installed, the script will offer to install it.

Installation

Clone the repository and navigate to the directory containing the script:

git clone https://github.com/DigitEgal/Config-json_Generator
cd Config-json_Generator

Ensure you have the necessary Python packages installed:

pip install -r requirements.txt

Usage

To generate the config.json file, run the script and follow the prompts:

python create_config-json.py

Workflow

Finetune Script Scraping: The script first asks for the name of the finetuning script located in the current directory. It then scrapes the script for relevant configuration details.
Optional Existing Config: You can provide an existing config.json file to use as a template. The script will pre-fill fields using this file where possible.
Auto-Detected Fields: The script automatically detects the installed version of transformers and other system configurations, inserting these values into the config file.
Interactive Configuration: The user is prompted to provide any missing or unclear values, with clear instructions and examples.
Review and Modify: Before saving, the script displays all the collected and auto-filled values. The user can modify any entries if needed.
Saving: The user can choose to overwrite an existing config.json file or save the new configuration under a different name.

Example

Here is an example session:

Please enter the finetune script filename (default: run_finetune.py): run_finetune.py
Do you want to load an existing config.json for defaults? (y/n): y
Please enter the config.json filename (default: config.json): config_existing.json
_transformers_version detected: 4.43.2
hidden_size detected: 4096
Please enter the intermediate size (default: 14336): 15360
...
Config entries detected and filled:
1. _name_or_path: unsloth/Meta-Llama-3.1-8B
2. hidden_size: 4096
3. intermediate_size: 15360
...
Do you want to modify any entry? (y/n): n
config.json already exists. Do you want to overwrite it? (y/n): n
Enter a new name for the config file (e.g., config_new.json): config_new.json
Config saved to config_new.json

Troubleshooting

Incorrect Scraping: If the script fails to correctly scrape values from the finetuning script, you may need to manually enter those values during the interactive prompts.
Transformers Version: Ensure transformers is installed and properly configured in your environment to avoid issues with version detection.

License

This project is provided "as is" without any warranties, express or implied. It is not licensed for any particular use, and the creators disclaim any liability for its use. You are free to use, modify, and distribute the code, but you do so at your own risk.

Contributing

Contributions are welcome! Please fork the repository and submit a pull request if you'd like to contribute.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
create_config-json.py		create_config-json.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Config.json Generator for Finetune Scripts

Features

Prerequisites

Installation

Usage

Workflow

Example

Troubleshooting

License

Contributing

About

Releases

Packages

Languages

DigitEgal/Config-json_Generator

Folders and files

Latest commit

History

Repository files navigation

Config.json Generator for Finetune Scripts

Features

Prerequisites

Installation

Usage

Workflow

Example

Troubleshooting

License

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages