The Ragwalla Blog

A blog about building and using an OpenAI Assistants compatible service

Key Differences Between the OpenAI Responses API and the Assistants API

​The OpenAI Responses API simplifies conversation state management by handling it server-side. Developers can maintain context across interactions by including the `previous_response_id` parameter in their requests, referencing the last response's ID. This approach eliminates the need to manually track conversation history, as the API retrieves and incorporates the entire conversation chain automatically. However, it's important to note that all previous input tokens in the conversation chain are billed as input tokens.

Hybrid Search With The OpenAI Assistants API

The OpenAI Assistants API does not natively support hybrid search or metadata indexing, but Ragwalla's implementation does. Hybrid search allows developers to build smarter, faster, and more precise data retrieval systems. Whether manually setting keywords or automating the process with LLM Auto mode, this approach enhances the effectiveness and efficiency of vector-based applications.

Real Use Cases for Cosine Similarity, Dot Product, and Euclidean Distance

Understanding cosine similarity, dot product, and Euclidean distance can be much easier with real-world analogies. These measures each capture “similarity” or “distance” in different ways — direction vs. magnitude vs. straight-line distance . We explore two narrative-style scenarios for each measure, showing when to use them and why.