AiMedley is one home for DeepSeek, Qwen, Kimi, and Llama — four open-source-adjacent models with very different personalities. Chat with one, compare all four, research with the wisdom of crowds, build agents, analyze documents.
Closed labs each ship their own chat app. Open-source-adjacent models — DeepSeek, Qwen, Kimi, Llama — never had a polished home where you could chat, compare, and build with all of them. Until now.
The right model depends on the prompt. AiMedley turns picking the right mind into a one-click side-by-side experiment — streaming live across all 4 columns at once.
Most AI apps look like spreadsheets. AiMedley feels like a toy you actually want to touch — neo-brutalist clay, color-coded personalities, and starter prompts tuned to each model's vibe.
Conversations, agents, and documents are scoped to your account. Your OpenRouter key is yours. We don't train on your prompts. We don't sell your data. We don't have ads.
Math, code, and step-by-step reasoning. The one you want when you need the answer to actually be right.
Fast, direct, and a little spicy. Great for brainstorming, naming, copy, and rapid back-and-forth.
Long context and lyrical prose. Best for storytelling, ideation, and content with character.
Warm, practical, and helpful. The friendly generalist — the one you'd ask for dinner ideas.
Typewriter-fast SSE token streaming, per-model temperature and length controls.
Send one prompt → watch all 4 models type their answers live in parallel columns.
Save reusable personas with system prompts + default models. Pick from starter templates or build from scratch.
Multi-model parallel briefing + a final synthesized digest. Optional web search with URL citations.
Upload PDF / text / code (≤5MB) and ask any of the 4 models grounded questions.
Flip a toggle and any model gets fresh web results (via OpenRouter's Exa plugin) with citations.
AiMedley routes every request through OpenRouter — a single API that unifies DeepSeek, Qwen, Kimi (Moonshot) and Llama. You pay only for what you use, at near-cost.
Single chats and the multi-model compare view both use Server-Sent Events. For compare we fan-out 4 streams server-side and tag every token with its model so all 4 columns type live.
After you send your first message we fire a cheap background call to summarize the chat into a 3-5 word title. No more sidebars full of 'Hey, how do I…'.
Flip the green Globe toggle on Chat / Compare / Research and any model gets fresh web results via OpenRouter's Exa plugin, complete with citation URLs.
Temperature and max-tokens are per-model and saved in your browser. Want Qwen wild and DeepSeek precise? Two clicks, persists forever.
Spin up the workbench in 10 seconds. No card. No setup. Bring your own OpenRouter key (or borrow ours in demo mode).
Open the workbench