Continuously validate LLM-based applications including LLM hallucinations, performance metrics, and potential pitfalls throughout the entire lifecycle from pre-deployment and internal experimentation to production.π
Thanks @kevin for hunting our LLM Evaluation solution π
π Hey, ProductHunt community
I am Shir, co-founder and CTO of Deepchecks. At Deepchecks, weβve built a pretty special solution for LLM Evaluation and are thrilled to launch it today on ProductHunt!
When we launched our open-source testing package last year, we quickly received an overwhelming response with over 3K stars π and more than 900K downloads! After the launch of our NLP package in June, we noticed that an incredible amount of the feedback calls we were having about the NLP package were asking for help with evaluating LLM-based apps. π€―
After creating an initial POC and getting feedback from various companies, we gained the confidence we needed to dive deeply into the LLM Evaluation space. And yes, turns out itβs a pretty big deal.
π As we began working on the LLM Evaluation module, weβve arrived at some important learnings that teams are struggling to figure out answers to these questions while deploying their LLM apps:
- Is it good? π (accuracy, relevance, usefulness, grounded in context, etc.)
- Is it not problematic? π (bias, toxicity, PII leakage, straying from company policy, etc.)
- Evaluating and comparing versions (that differ in their prompts, basemodels, or any other change in the pipeline)
- Efficiently building a process for automatically estimating the quality of the LLM interactions and annotating them
- Deployment lifecycle management from experimentations/development, staging/beta testing, to production.
Deepchecks LLM Evaluation solution helps with-
β Simply and clearly assess "How good is your LLM application?"
π Track and compare different combinations of prompts, models, and code.
π Gain direct visibility into the functioning of your LLM-based application.
β οΈ Reduce the risk during the deployment of LLM-based applications.
π Simplify compliance with AI-related policies and regulations.
We're also hosting a launch event today at 8.30 AM PST today, feel free to sign up to interact with the Deepchecks team and see a live demo: https://www.linkedin.com/events/...
Apply for Deepcheks LLM evaluation access:
https://deepchecks.com/solutions...
π Would appreciate any questions, and hope to see you there!
Congratulations ! π Your journey from the overwhelming success of your open-source testing package to this latest venture is truly inspiring.
This is amazing product. Great stuff, we are using deepchecks for our internal LLM evaluation, requires couple of minutes to get big insights! I really like it.
About Deepchecks LLM Evaluation on Product Hunt
βValidate, monitor, and safeguard LLM-based appsβ
Deepchecks LLM Evaluation launched on Product Hunt on November 28th, 2023 and earned 196 upvotes and 92 comments, placing #7 on the daily leaderboard. Continuously validate LLM-based applications including LLM hallucinations, performance metrics, and potential pitfalls throughout the entire lifecycle from pre-deployment and internal experimentation to production.π
Deepchecks LLM Evaluation was featured in Developer Tools (512.7k followers) and Artificial Intelligence (469k followers) on Product Hunt. Together, these topics include over 163.4k products, making this a competitive space to launch in.
Who hunted Deepchecks LLM Evaluation?
Deepchecks LLM Evaluation was hunted by Kevin William David. A βhunterβ on Product Hunt is the community member who submits a product to the platform β uploading the images, the link, and tagging the makers behind it. Hunters typically write the first comment explaining why a product is worth attention, and their followers are notified the moment they post. Around 79% of featured launches on Product Hunt are self-hunted by their makers, but a well-known hunter still acts as a signal of quality to the rest of the community. See the full all-time top hunters leaderboard to discover who is shaping the Product Hunt ecosystem.
Reviews
Deepchecks LLM Evaluation has received 6 reviews on Product Hunt with an average rating of 5.00/5. Read all reviews on Product Hunt.
Want to see how Deepchecks LLM Evaluation stacked up against nearby launches in real time? Check out the live launch dashboard for upvote speed charts, proximity comparisons, and more analytics.