Engineering Insights

Our Blog

Insights, tutorials, and industry analysis from our team of engineers and consultants.

FeaturedSoftware Craftsmanship

How to Vibe Code with AI Agents and Actually Ship Production-Ready Features

Spec Driven Development means you write a specification - a clear, structured description of what you're building - before any code is written.

Manav Oza

2026-05-19

5 min read

Read Article

Frontend Development

Quick Walkaround - Next.js 16.2: AI Improvements

Next.js 16.2 just dropped and it's a big one for anyone working with AI coding agents. Here's what actually shipped, what it means for your projects, and how to adopt it today.

Why Session Affinity Is Important When Configuring a Load Balancer

A practical guide with Nginx setup Introduction If you've ever scaled a backend service behind a load balancer and suddenly found users getting logged out, WebSocket connections dropping, or inconsistent API responses - you've likely hit a session affinity problem. In this article, I'll explain why...

Manav Oza

2026-04-21

3 min read

Engineering Case Studies

How FASTag Actually Works - A Backend Deep Dive

Every time you pass a toll, a distributed payment transaction runs across five independent systems, two banks, and NPCI's central switch - and the barrier lifts before you've finished blinking. That's roughly 9.5 million times a day.

IaC - Infrastructure as Code: Stop Clicking. Start Coding.

Most dev teams manually configure servers, click through cloud consoles, and write "tribal knowledge" wikis nobody reads. Then someone changes one setting, and the entire deployment breaks.

Manav Oza

2026-04-14

3 min read

AI & Machine Learning

What Are Vector Embeddings? A Developer's Guide

A vector embedding is a numerical representation of data, think of it as translating words, images, or any piece of content into a list of numbers (a vector) that captures its meaning.

Manav Oza

2026-03-12

4 min read

AI & Machine Learning

RAG Explained Simply (With Workflows You Can Actually Build Today)

RAG (Retrieval-Augmented Generation) fixes this by injecting the right context into the prompt at query time, instead of baking it into the model at training time.

Manav Oza

2026-03-10

5 min read

AI & Machine Learning

Prompt Caching — Cut Your AI API Costs by 90%

If the same prompt is sent multiple times, reuse the previous response instead of calling the API again.

Manav Oza

2026-03-05

3 min read

AI & Machine Learning

Building Your First MCP Server (Model Context Protocol)

The rise of AI agents has shifted how we build software. Instead of just APIs, we now design context-aware systems that can communicate intelligently with LLMs.

Manav Oza

2026-03-03

2 min read

AI & Machine Learning

Temperature & Top_p in Language Models: The Two Knobs Every AI Engineer Must Understand

You can write perfect prompts and still get bad outputs — if you don't understand these two parameters.

Manav Oza

2026-02-26

6 min read

AI & Machine Learning

Writing an Effective Prompt: A Developer's Guide to Getting the Best Out of AI

If you've been treating AI prompts like Google search queries, you're leaving 80% of the value on the table.

Manav Oza

2026-02-24

6 min read

AI & Machine Learning

The Art of Context Window Optimization

You're in a conversation with an AI. You ask it a question. It gives you a mediocre answer. So you paste in more information, more examples, more context. Then another 200 characters. Then a whole code file. And somehow... the answer gets worse, not better.

Manav Oza

2026-02-19

5 min read

AI & Machine Learning

Understanding Tokens and Tokenization: The Foundation of Modern LLMs

When you interact with ChatGPT, Claude, or any other large language model, there's a fundamental process happening behind the scenes that most users never think about: tokenization. Understanding this concept isn't just academic—it directly impacts your API costs, prompt design, and how effectively you can leverage AI in your applications.

Manav Oza

2026-02-17

4 min read

Test Driven Development

Stop Chasing 100% Code Coverage: A Better Testing Strategy for Jest

As engineering leaders and developers, we've all been there—staring at a coverage report showing 98% and feeling that nagging urge to squeeze out those last two percentage points. But here's the uncomfortable truth: 100% code coverage doesn't guarantee 100% bug-free code.

Manav Oza

2026-02-12

3 min read

Test Driven Development

Understanding Gherkin Feature Files: Making Test Scenarios Human-Readable

When building end-to-end test automation, one of the biggest challenges teams face is the communication gap between technical and non-technical stakeholders. Product managers, QA analysts, and developers often struggle to speak the same language when discussing test scenarios. This is where Gherkin feature files come in.

AWS Cognito: The Complete Guide to Getting It Right the First Time

AWS Cognito is a fully managed authentication, authorization, and user management service. It handles user sign-up, sign-in, account recovery, and access control without requiring you to build or maintain authentication infrastructure.

FlashList vs FlatList: A Deep Dive into React Native's Performance Revolution

If you're building React Native applications with lengthy scrollable lists, you've likely encountered performance bottlenecks. Shopify's FlashList emerged as a solution to these challenges, promising up to 10x better performance than the traditional FlatList. Let's explore what makes FlashList different and when you should adopt it in your projects.

Why Semantic HTML Still Matters in Modern Web Development

Semantic HTML is one of those fundamentals that pays dividends across your entire stack, from SEO to accessibility to long-term maintainability. Let me break down why it deserves your attention, even when deadlines are tight.

Framer Motion Quickstart: Bring Your React Apps to Life

Animations can make or break user experience. They guide attention, provide feedback, and make interfaces feel responsive and alive. But let's be honest—implementing smooth, performant animations in React has traditionally been a pain.

Understanding LEFT JOINs in SQL: A Complete Guide

A LEFT JOIN (also called LEFT OUTER JOIN) returns all records from the left table and the matched records from the right table. If there's no match, the result will still include the row from the left table, but with NULL values for columns from the right table.

Correlation IDs: The Unsung Hero of Distributed Systems

In the world of distributed systems, where requests traverse multiple services and components, understanding the flow of a request can be challenging. This is where correlation IDs come into play—a simple yet powerful technique to trace and debug requests across system boundaries.

Manav Oza

2026-01-19

4 min read

Application Development

Offline-First Mobile Apps: Building Resilient Apps with PowerSync and React Native

Have you ever used an app and immediately felt frustrated because it didn't work without an internet connection? Today's users expect their mobile apps to work seamlessly whether they're on a strong WiFi connection, a slow 4G network, or completely offline. This is where offline-first architecture comes in—a paradigm shift that prioritizes local data storage and synchronization to deliver superior user experiences.

Complete Guide to Rate Limiting in Node.js: Express & NestJS

Rate limiting is a critical security mechanism that controls how many requests a client can make to your API within a specified time window. Without proper rate limiting, your application becomes vulnerable to abuse, DDoS attacks, and resource exhaustion.

Quickstart with Knex - Query Builder for JavaScript

Imagine you're building a JavaScript application that needs to talk to a database. You could write SQL queries as strings, but that gets messy fast. Knex is like having a translator that converts clean JavaScript code into proper SQL queries.

CStreaming UI: Building Real-Time Interfaces with Server Components + Suspense

Picture this: A user lands on your e-commerce dashboard. Instead of staring at a loading spinner for 3 seconds while everything loads, they immediately see the header, then the navigation appears, followed by product cards streaming in as they're ready. This isn't magic—it's the power of Server Components combined with Suspense.

Crafting Frontend with Atomic Design Pattern

Picture this: You're building a house. Would you start by constructing entire rooms, or would you first ensure you have quality bricks, mortar, windows, and doors? Just like in construction, building scalable frontend applications requires starting with the smallest, most fundamental pieces and combining them into increasingly complex structures.

Manav Oza

2025-09-05

8 min read

AI & Machine Learning

Perplexity Comet Browser: A Comprehensive Developer’s Guide for 2025

In an era of information overload and rapid technological innovation, the Perplexity Comet Browser stands apart as an AI-native web platform crafted for the next generation of human-Internet interaction. Launched in May 2025, Comet is designed to revolutionize not just how we browse, but how we learn, think, build, and act online—making it a compelling tool for developers of all backgrounds.

Manav Oza

2025-08-29

10 min read