<aside> 💭 Below content is curated by community to help beginners to get started with LLM models and dataset contributions. There are a lot of ways to make contributions. If you would like to offer any useful guide, please approach us in #contributor-chat!

</aside>

Hey fellows data scientist, engineers and AI enthusiasts, welcome to Virtual Protocol (VP). Today i want to invite you guys to be the contributors on our platform. Our fellow community has curated a fair amount of guides to help the beginners to get started!

You may follow through the guide.

<aside> 💡 Let us know your progress in #contributor-chat!

</aside>

Are you excited to start? Read on.

Follow the steps here to get started. Below are the guides you can refer to.

🧠 LLM models

Beginner's Guide to Web Scraping with Python

Beginner guide to finetune model

🔊 Voice Models

Beginner guide to contribute Audio Core in Virtual Protocol

Common questions

Where can i run my training?

We are offering free credit for model finetuning! Each GPU will be reloaded with $50 for training. Please reach out to your community lead or Virtual Core Team for access.

How big of the dataset should I prepare?

We are paying higher for the dataset so we expect some effort in curating the dataset. You can scrape, collect or generate synthetic dataset. But the size should be sufficient for a LoRA finetuning. ~1k pairs is good. 100 rows is bad.