Deep Eval Framework Using Python

adewale/skill-eval-harness

Skill Eval Harness is a Python CLI for testing whether an Agent Skill changes observable output. It reads evals/shared-benchmark.json, emits answer-key-safe task rows, grades files under eval-runs/, ...

IEEE

A dual-stage reconstruction and optimization deep learning framework for generating high-precision seamless precipitable water vapor across the mainland United States

Abstract: Precipitable water vapor (PWV) is critical to global climate dynamics, the terrestrial water cycle, and extreme weather events. However, current Moderate Resolution Imaging Spectroradiometer ...

IEEE

A Flexible Multi-Agent Deep Reinforcement Learning Framework for Dynamic Routing and Scheduling of Latency-Critical Services

Abstract: Timely delivery of delay-sensitive information over dynamic, heterogeneous networks is increasingly essential for a range of interactive applications, such as industrial automation, ...

GitHub

eval-framework.md

Review Eval Framework Problem Agent Validator supports multiple code review adapters (Claude Code, Codex CLI, GitHub Copilot CLI), each configurable with different models, aliases, and thinking/effort ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results