evalutation

Here are 8 public repositories matching this topic...

zjukg / SKA-Bench

[Paper][EMNLP 2025] SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs

benchmark knowledge table knowledge-graph understanding evalutation large-language-models llm-benchmarking structured-knowledge ska-bench

Updated Aug 27, 2025
Python

amagovpt / accessmonitor-rulesets

Star

RuleSets of AccessMonitor - the validator of web accessibility practices

rules web wcag evalutation

Updated Apr 6, 2026
TypeScript

Miccighel / NewBestSub

Star

Efficient topic-set reduction for IR evaluation using NSGA-II

kotlin information-retrieval source-code nsga-ii jmetal evalutation newbestsub

Updated Oct 3, 2025
Kotlin

ActualLearner / evalboard

Star

Single-turn LLM evaluation platform. Run structured evals across 5 different AI providers. Score outputs, track latency, and compare models through a live analytics dashboard.

react python django ai gemini openai cs50 evalutation cs50w groq llm anthropic litellm deepseek

Updated May 2, 2026
JavaScript

M0inUddin / LangChain-for-LLM-Application-Development

Star

Jupyter Notebooks of Course of LangChain for LLM Application Development by DeepLearning.AI

parser question-answering chains agents evalutation llm langchain

Updated Oct 17, 2023
Jupyter Notebook

An MCQ Scanner App for students developed using Flutter and Flask. Students can take a picture of a physical MCQs exam paper and the app gives them an interactive quiz experience where they can solve those mcqs on the app and AI evaluates them.

ai localstorage exam quiz flutter quizapp summarizer mcq textrecognition flutter-apps gemini-api evalutation examapp

Updated May 16, 2026
Dart

ManueleVeggi / mytisse

Star

Repository for the final thesis in "Interaction Media Design" (Prof. Sofia Pescarin) at University of Bologna, MA "Digital Humanities and Digital Knowledge" (a.y. 2022/2023)

interaction-design digital-humanities ux-experience museum-experience ux-research evalutation emotion-design

Updated Jan 7, 2025
Jupyter Notebook

victord03 / ds_armor_comparison

Star

Compare armor sets from the video game Dark Souls across most categories, given the attributes defined by the developers (of the game).

comparison evalutation

Updated Nov 4, 2025
Python

Improve this page

Add a description, image, and links to the evalutation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the evalutation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evalutation

Here are 8 public repositories matching this topic...

zjukg / SKA-Bench

amagovpt / accessmonitor-rulesets

Miccighel / NewBestSub

ActualLearner / evalboard

M0inUddin / LangChain-for-LLM-Application-Development

5-abdulsami / mcq_scanner

ManueleVeggi / mytisse

victord03 / ds_armor_comparison

Improve this page

Add this topic to your repo