AI검증 모음

여기는 AI검증에 대한 내용을 모아놓은 글입니다.

DeepEval - The Open-Source LLM Evaluation Framework

Prompt and Model Discovery Gain insights to quickly iterate towards optimal prompts and model

docs.confident-ai.com

LLM평가에 대한 오픈소스 프레임워크입니다.

LLM에 대한 단위 테스트 케이스 작성등을 할 수 있습니다.

GitHub - explodinggradients/ragas: Supercharge Your LLM Application Evaluations 🚀

Supercharge Your LLM Application Evaluations 🚀. Contribute to explodinggradients/ragas development by creating an account on GitHub.

github.com

LLM 애플리케이션을 평가하는 툴입니다.

AI로 생성한 이미지는 어떻게 평가할까요? (기본편)

들어가며 최근 몇 년간 생성 모델은 인공 지능 분야에서 혁신적인 도구로 부상하며 연구자와 산업 리더들의 큰 관심을 받고 있습니다. 생성 모델은 딥러닝 기술의 발전을 바탕으로 고품질...

techblog.lycorp.co.jp

라인에서 AI로 생성한 이미지에 대한 평가방법 정리한 내용입니다.

GitHub - langchain-ai/openevals: Readymade evaluators for your LLM apps

Readymade evaluators for your LLM apps. Contribute to langchain-ai/openevals development by creating an account on GitHub.

github.com

RAGAS관련 내용 모음 (1)	2025.04.19
AI관련 활용사례모음 (0)	2025.04.11

QA의 테스트 이야기