top of page
Blogs


Which Vector Search Should I Use? A Practical Guide
Modern AI applications from chatbots and recommendation systems to image search rely on searching high‑dimensional vectors. A well‑chosen index makes these searches fast and memory‑efficient, but there are many options. This guide answers a simple yet important question: which vector search index should you use? We start with a gentle introduction to vector search and indexing, then explore the major index types: Hierarchical Navigable Small World (HNSW), Inverted File (IVF),
Varad Khonde
8 min read
Â
Â
Â


5 Learnings from developing AI agents in production
Trying to deploy your agent to production? Here are 5 detailed learnings from the Newtuple team on what works, what doesn't, and practical tips based on our experience
Dhiraj Nambiar
14 min read
Â
Â
Â


AI Agents: Bridging the Gap Between Hype and Reality
There's a lot of hype around AI agents today, let us decipher the hype from the reality
Newtuple Marketing Team
5 min read
Â
Â
Â


The Hidden Cost of Bad Prompts: Why Typos and Sloppy Formatting Sabotage Your Business AI
Do Typos in your prompts matter? More than you think!
Newtuple Marketing Team
3 min read
Â
Â
Â


Finance-Friendly OCR: How Docling, Dolphin & Others Tackle Wall Street PDFs
This article compares a few prominent Open source OCR libraries for document extraction for your AI use cases.
Shikhar Jha
6 min read
Â
Â
Â


Run Deepseek R1 on your laptop in 5 minutes or less
Run Deepseek R1 on your laptop in 5 minutes or less
Rahul Kumar
1 min read
Â
Â
Â


How to install and use ollama for hosting LLMs locally
A 5 minute setup guide for Ollama
Shikhar Jha
2 min read
Â
Â
Â


Introducing our Generative AI Accelerators
This is an introduction into our Generative AI accelerators, helping teams develop better production grade applications at a fraction of the time
Dhiraj Nambiar
3 min read
Â
Â
Â


From Prototype to Production: Best Practices for Building Production‑Ready AI Systems
From Markdown inputs to load-balanced LLM endpoints, this guide walks through key lessons learned while taking a financial AI prototype into production. Learn how structured prompts, JSON validation, exponential backoff, batching, observability tools, and smart frameworks like LiteLLM can transform your system into a scalable, resilient AI service.
Rahul Kumar
11 min read
Â
Â
Â


How to Tackle Model Interoperability
LLM interoperability isn't just swapping APIs—each model has unique quirks. True agility demands rigorous testing, not plug-and-play hype.
Dhiraj Nambiar
8 min read
Â
Â
Â


Lost in Translation? Boosting Accuracy in PDF-Based Data Extraction using LLMs
Transform raw PDFs into structured Markdown to boost LLM accuracy in extracting actionable data from complex, tabular documents.
Aniket Kulkarni
3 min read
Â
Â
Â


How AI-Driven Software Development is Transforming the Coding and IT Industries
LLMs revolutionize coding & IT projects—faster dev, better debugging, smarter docs. Harness AI while mitigating risks!
Sameer Kankute
4 min read
Â
Â
Â


Bridging the Gap: How Hybrid Chatbots are Redefining Intelligent Conversations
Hybrid chatbots blend structured workflows with AI-powered agents, delivering intelligent and scalable conversations for businesses.
Shivkumar Salunkhe
7 min read
Â
Â
Â


Comparing Reasoning models: DeepSeek vs Gemini vs OAI
Comparing DeepSeek R1, Gemini 2.0 Flash, Gemini 1.5 Pro, o1, and o1-mini on coding challenges—who excels in reasoning and optimization?
Praharshita Kulkarni
5 min read
Â
Â
Â


AI Meets Sales: Analyzing B2B Sales Calls with AI
Boost B2B sales with AI-powered call analysis. Discover insights on metrics, sentiment, and features to enhance strategies and conversions.
Rahul Dalvi
4 min read
Â
Â
Â


AWS Expands Its AI Frontier: Introducing Amazon Nova and Enhanced Bedrock Guardrails
AWS’s Amazon Nova models and enhanced Bedrock Guardrails deliver cost-effective, high-performance genAI with responsible, secure deployment.
Praharshita Kulkarni
5 min read
Â
Â
Â


Speed and Scalability in Vector Search
Which vector database should you choose when you're creating your application? Here are some comparisons for you to understand better
Sameer Kankute
6 min read
Â
Â
Â


How OpenAI implements Structured Outputs
How does OpenAI implement structured outputs in it's latest model? Read more to find out
Sameer Kankute
3 min read
Â
Â
Â


Temporal accuracy and LLMs
Earlier this month a new research paper was released, which extensively tested capabilities of Large Language Models (LLMs) at a...
Dhiraj Nambiar
5 min read
Â
Â
Â


Master the Art of Data Streamlining with Pub/Sub Pipelines
Data pipeline Creating a pipeline that requires sending some events to be processed in the background keeping scalability in mind is one of the key requirements for building applications today. In order to process heavy/time consuming jobs in your current application it’s generally better to separate it from your core application, and one way to achieve is to use a pub/sub architecture. The idea here is as follows: Maintain a queue of messages A publisher pushes messages to t
Rahul Kumar
2 min read
Â
Â
Â
bottom of page