<aside> 💠Below content is curated by community to help beginners to get started with LLM models and dataset contributions. There are a lot of ways to make contributions. If you would like to offer any useful guide, please approach us in #contributor-chat!
</aside>
Hey fellows data scientist, engineers and AI enthusiasts, welcome to Virtual Protocol (VP). Today i want to invite you guys to be the contributors on our platform. Our fellow community has curated a fair amount of guides to help the beginners to get started!
You may follow through the guide.
<aside> 💡 Let us know your progress in #contributor-chat!
</aside>
Follow the steps here to get started. Below are the guides you can refer to.
Beginner's Guide to Web Scraping with Python
Beginner guide to finetune model
Beginner guide to contribute Audio Core in Virtual Protocol
Where can i run my training?
We are offering free credit for model finetuning! Each GPU will be reloaded with $50 for training. Please reach out to your community lead or Virtual Core Team for access.
How big of the dataset should I prepare?
We are paying higher for the dataset so we expect some effort in curating the dataset. You can scrape, collect or generate synthetic dataset. But the size should be sufficient for a LoRA finetuning. ~1k pairs is good. 100 rows is bad.