New Show Hacker News story: Show HN: Data Bonsai: a Python package to clean your data with LLMs

Show HN: Data Bonsai: a Python package to clean your data with LLMs
2 by alvin_r_h | 0 comments on Hacker News.
I've been doing some data cleaning for my fine tuning projects using LLMs, and decided to just build a package for it as a side project. Check it out here: https://ift.tt/Yp3BSCD Some features: - categorization (labelling), transformation and decomposition (text into structured format) - validates llm outputs - batch mode batches up the inputs/outputs so you don't send the prompt (schema, fewshot examples) for every row of data, saving a significant amount of tokens There are some similarities to the Instructor repo, but this is simpler and made for datasets. Would love any feedback/suggestions (and a star if you like it!)

إرسال تعليق

أحدث أقدم

نموذج الاتصال

How To Get It For Free?

If you want to get this Premium Blogger Template for free, simply click on below links. All our resources are free for skill development, we don't sell anything. Thanks in advance for being with us.