<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Tutorials on Krypton Runtime</title><link>https://www.kryptonhq.com/docs/tutorials/</link><description>Recent content in Tutorials on Krypton Runtime</description><generator>Hugo</generator><language>en-us</language><atom:link href="https://www.kryptonhq.com/docs/tutorials/index.xml" rel="self" type="application/rss+xml"/><item><title>Deploying your first Agent</title><link>https://www.kryptonhq.com/docs/tutorials/first-agent/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://www.kryptonhq.com/docs/tutorials/first-agent/</guid><description>&lt;p&gt;This guide deploys a custom agent into a running Krypton cluster,
verifies the gateway route, and shows where scaling signals come from.
Krypton treats your container as a black box that speaks A2A, MCP, or
plain HTTP.&lt;/p&gt;
&lt;h2 id="ports--endpoints--the-two-minute-mental-model"&gt;Ports &amp;amp; endpoints — the two-minute mental model&lt;/h2&gt;
&lt;p&gt;Krypton exposes &lt;strong&gt;two&lt;/strong&gt; HTTP services. Talk to the right one:&lt;/p&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th style="text-align: right"&gt;Port&lt;/th&gt;
 &lt;th&gt;Service&lt;/th&gt;
 &lt;th&gt;What lives here&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td style="text-align: right"&gt;&lt;strong&gt;8080&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;&lt;strong&gt;Gateway&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;All agent traffic. Anything under &lt;code&gt;/v1/agents/{ns}/{name}/*&lt;/code&gt; is reverse-proxied to your pod — the protocol RPC at &lt;code&gt;/&lt;/code&gt;, the A2A card at &lt;code&gt;/.well-known/agent-card.json&lt;/code&gt;, OAuth callbacks at &lt;code&gt;/oauth/...&lt;/code&gt;, MCP SSE streams, anything. The gateway strips the &lt;code&gt;/v1/agents/{ns}/{name}&lt;/code&gt; prefix; your container sees the original sub-path.&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td style="text-align: right"&gt;&lt;strong&gt;8090&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;&lt;strong&gt;Control plane&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Web UI and introspection APIs (&lt;code&gt;/v1/agents&lt;/code&gt;, &lt;code&gt;/v1/agents/{ns}/{name}/mcp/tools&lt;/code&gt;, &lt;code&gt;/v1/agents/{ns}/{name}/status&lt;/code&gt;). Operator tooling only — &lt;strong&gt;never&lt;/strong&gt; the path your clients use to invoke an agent.&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;Rule of thumb: if it&amp;rsquo;s a normal A2A / MCP / HTTP client, point it at
&lt;code&gt;:8080&lt;/code&gt;. If it&amp;rsquo;s a browser or &lt;code&gt;kubectl&lt;/code&gt;-adjacent tool, &lt;code&gt;:8090&lt;/code&gt;.&lt;/p&gt;</description></item><item><title>Deploy Your First LLM</title><link>https://www.kryptonhq.com/docs/tutorials/first-llm/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://www.kryptonhq.com/docs/tutorials/first-llm/</guid><description>&lt;p&gt;Krypton lets you serve an LLM the same way you manage the rest of your
cluster: declare a resource, let the controller create the workload, and
send traffic through the gateway.&lt;/p&gt;
&lt;p&gt;Krypton serves Hugging Face GGUF models with &lt;a href="https://github.com/ggerganov/llama.cpp"&gt;llama.cpp&lt;/a&gt;. A
&lt;code&gt;Model&lt;/code&gt; resource points at a repo and file, the controller creates a
Deployment and Service, and Krypton exposes the model through
OpenAI-compatible endpoints:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;code&gt;GET /v1/models&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;code&gt;POST /v1/chat/completions&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;code&gt;POST /v1/completions&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;code&gt;POST /v1/embeddings&lt;/code&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Any OpenAI SDK can use the gateway as its &lt;code&gt;base_url&lt;/code&gt;.&lt;/p&gt;</description></item></channel></rss>