Skip to main content

Using Donut-LLM-Tools to create a model

 In today's blog, we will be going to discuss and see how do we create a LLM using Donut-LLM-Tools.

The first and foremost thing we need to train a model is to create a valid dataset that contains all the training data required to train the model.

To get a dataset you can either use huggingface or use Donut-LLM-Tools to create a dataset by scrapping Wikipedia.



from donutllmtools import Tools
Tools.DatasetCreator()
Tools.LLMCreator()

The above code imports the Tools class from the Donut-LLM-Tools module, this class have the functions DatasetCreator() and LLMCreator(). The DatasetCreator() function, once called will automatically start scrapping Wikipedia and write it to a DoDS (Donut DataSet) file. The LLMCreator() function is in a menu driven format, once called it presents the user with a menu for 1. Creating a model, 2. Loading a model to ask prompts to it and 3. Exit.

When the user presses 1. then the menu asks for the dataset directory and filename as well as the number of iterations you want to train the model for. When the user presses 2. then the menu asks for the model's directory and filename and then prompts user to enter a prompt and runs in loop until the user types 'exit' or 'quit'.

Hooray.! You have now learnt to use Donut-LLM-Tools to create your own datasets and models and run them locally.

Comments

Popular posts from this blog

Introducing new Python Modules

 Hello everyone, I have developed three Python modules and published them to pypi.org. The three modules are Donut-LLM-Tools, DonutLLMCore and GIUC. Donut-LLM-Tools, provides users an easy UI for creating, loading models as well as create datasets from Wikipedia wikis. DonutLLMCore is a library that is used by Donut-LLM-Tools to create a PyTorch model. GIUC (Gautham's Important Utility Collection), is a set of mathematical functions designed to help users solve complex math problems. LINKS :  Donut-LLM-Tools DonutLLMCore GIUC

Introducing Project Procyon

I am happy to announce our new project, the Project Procyon which includes ProcyonOS, ProcyonHub and ProcyonAI. I will slowly be phasing out Donut branding from our software, however some packages might retain the branding of Donut. I am phasing out DonutLinux Project in favour of Project Procyon to bring more latest improvements and UI aesthetics in our software. Project Procyon's development will start after few days of the release of Linux Kernel version 7. LINK: ProcyonOS Test Website