Language Model Training

Distill knowledge from large models into smaller, faster
and fully private pipelines for your use case that you
can run cheaply and efficiently in-house.

prodigytrain./information_extraction--ner news_ner--textcat news_textcat=========== Training pipeline ===========48% | ████████████████

Build better, faster and fully private pipelines

Prodigy makes expert workflows and the latest best practices available to everyone. Build transparent AI systems by distilling domain-specific knowledge from larger models and human experts into fully private pipelines that you can run cheaply and efficiently in-house.

================= Training pipeline =================Pipeline: ['transformer', 'ner', 'textcat'] # ENTS_F ENTS_P ENTS_R CATS_SCORE SCORE---- ------ ------ ------ ---------- ------ 0 0.06 0.03 0.17 46.23 0.23 200 25.02 27.90 22.68 45.34 0.35 400 86.10 87.65 84.60 72.06 0.79 600 87.98 86.91 89.07 74.66 0.81

Take back control

Prodigy runs entirely under your control, making it suitable for even the strictest privacy requirements. You can download it and run it locally right out of the box, or adapt it to serve your infrastructure needs. The models you produce are yours as well, with absolutely no lock-in.

Real-world case studies

Documentation

Overview
  • Downloadable developer tool and library
  • Create, review and train from your annotations
  • Runs entirely on your own machines
  • Powerful built-in workflows

Pricing

Overview
  • Lifetime license, pay once, use forever
  • Flexible options for individuals and teams
  • Full privacy, no data leaves your servers
  • Download and install like any other library