Ragwalla Memory and Knowledge Graphs are now generally available!

Relace Models

Explore the Relace language and embedding models available through our OpenAI Assistants API-compatible service.

Relace: Relace Apply 3

Context Length:: 256,000 tokens
Architecture:: text->text
Max Output:: 128,000 tokens

Pricing:

Prompt: $0.00000085

Completion: $0.00000125

Relace Apply 3 is a specialized code-patching LLM that merges AI-suggested edits straight into your source files. It can apply updates from GPT-4o, Claude, and others into your files at 7,500 tokens/sec on average.

The model requires the prompt to be in the following format:
{instruction}
{initial_code}
{edit_snippet}

Zero Data Retention is enabled for Relace. Learn more about this model in their documentation

Ready to build with Relace?

Start using these powerful models in your applications with our flexible pricing plans.

View Pricing