github.com/evalstate

Open Responses

Open standard extending OpenAI's Responses API. Provides a consistent, provider neutral way to interact with modern LLMs. Repairs Chat Completion API drift.

It defines a shared schema, and tooling layer that enable a unified experience for calling language models, streaming results, and composing agentic workflows—independent of provider.

Usage as a Provider / Router allows creation of rich Agent Environments

Internal Tools - (Model or Provider)

shell and local_shell
code_interpreter
apply_patch
web_search
etc..

External Tools (Client Supplied)

MCP Servers
Standard JSON function calls
Free-Form Tools
Grammar constrained Tools

Shaun Smith `@evalstate`

The evolution of Tool Calling....

Things we didn't have 18 Months ago...

MCP Streamable HTTP Transport and OAuth

AGENTS.MD and Agent Skills

Internal Tools in Inference APIs

Agent Client Protocol
Responses API

Long Running Tool Loops (and reasoning models)

Reinforcement Learning

Smaller and Simpler Harnesses

Self Directing Models

Dynamic Tool Calling

Why This Enabled Skills

Training Models

Building Kernels

`https://github.com/huggingface/upskill`

Upskill

Code Execution Tools

LLMs for Navigating: GenUI, Apps SDK (Prefect Prefab)

Closing Thoughts

Thank You!

Agent Client Protocol

Open Responses

Open standard extending OpenAI's Responses API. Provides a consistent, provider neutral way to interact with modern LLMs. Repairs Chat Completion API drift.

Usage as a Provider / Router allows creation of rich Agent Environments

It was close....! PMF for MCP

Generation and Execution Environments

Style 1 - Main Model owns Code Generation

Style 2 - Delegated Code Generation

MCP makes it easy to transfer generation and execution between models and environments!
(and who pays for inference)

Shaun Smith @evalstate

The evolution of Tool Calling....

Things we didn't have 18 Months ago...

MCP Streamable HTTP Transport and OAuth

AGENTS.MD and Agent Skills

Internal Tools in Inference APIs

Agent Client Protocol Responses API

Long Running Tool Loops (and reasoning models)

Reinforcement Learning

Smaller and Simpler Harnesses

Self Directing Models

Dynamic Tool Calling

Why This Enabled Skills

Training Models

Building Kernels

https://github.com/huggingface/upskill

Upskill

Code Execution Tools

LLMs for Navigating: GenUI, Apps SDK (Prefect Prefab)

Closing Thoughts

Thank You!

Agent Client Protocol

Open Responses

Open standard extending OpenAI's Responses API. Provides a consistent, provider neutral way to interact with modern LLMs. Repairs Chat Completion API drift.

Usage as a Provider / Router allows creation of rich Agent Environments

It was close....! PMF for MCP

Generation and Execution Environments

Style 1 - Main Model owns Code Generation

Style 2 - Delegated Code Generation

MCP makes it easy to transfer generation and execution between models and environments! (and who pays for inference)

Shaun Smith `@evalstate`

Agent Client Protocol
Responses API

`https://github.com/huggingface/upskill`

MCP makes it easy to transfer generation and execution between models and environments!
(and who pays for inference)