Prompt Engineering for Production Systems

Moving prompt engineering from Jupyter notebooks into production demands a shift from trial-and-error strings to deterministic systems. When users rely on your AI features, an output format error or a hallucinations spike translates directly into application crashes and customer churn.

In this deep dive, we will explore the three pillars of production prompting:

Structured JSON Outputs with schema enforcement.
The compilation model of prompts using DSPy.
Establishing systematic assertion loops.

---

1. Enforcing Structured Output

In production, you should never parse raw text blocks using regex or string splits. You need the LLM to output predictable structures. Using TypeScript and Zod, we can define schema interfaces and leverage libraries like Instructor to force output consistency.

Here is a typical production pattern for validating LLM output using Zod:

typescript

import { z } from "zod";

// 1. Define the desired output structure
export const SentimentAnalysisSchema = z.object({
  sentiment: z.enum(["positive", "neutral", "negative"]),
  confidenceScore: z.number().min(0).max(1),
  keyEntities: z.array(z.string()).describe("List of core topics mentioned"),
  summary: z.string().describe("A concise 1-sentence recap"),
  needsCustomerSupport: z.boolean().describe("True if customer seems angry or frustrated")
});

export type SentimentAnalysis = z.infer<typeof SentimentAnalysisSchema>;

Pro Tip: Always add `.describe()` tags to your Zod keys. Models use these descriptions during structured tool calls as explicit semantic guides on what data to put inside each field.

---

2. DSPy: Compiling Prompts

Instead of manually editing prompt strings, DSPy introduces a programmatic approach. It models prompt generation as an optimization problem:

Signatures: Define what the inputs and outputs are (e.g., `question -> answer`).
Modules: Assemble pipelines (e.g., `ChainOfThought`, `ReAct`).
Teleprompters (Optimizers): Optimize prompts by compiling them against a set of examples.

Here is a conceptual comparison of standard prompting vs. DSPy program design:

Feature	Ad-Hoc Prompting	DSPy Compilation
Updates	Manual rewrite	Re-run compiler with new training data
Model Porting	Often breaks; requires tuning	Seamless; optimizer adjusts signatures
System Flow	Hard to trace	Programmatic pipelines

---

3. Creating Assertion Loops

Sometimes, structured outputs are syntactically correct but semantically invalid. For example, the JSON parses, but the summary is empty or contains forbidden phrases.

To prevent this, deploy Assertion Loops in your API middleware:

typescript

async function generateValidatedSentiment(review: string) {
  let attempts = 0;
  const maxAttempts = 3;
  let feedback = "";

  while (attempts < maxAttempts) {
    const prompt = getPromptWithFeedback(review, feedback);
    const result = await callLLM(prompt);
    
    const parsed = SentimentAnalysisSchema.safeParse(result);
    if (parsed.success) {
      // Semantic check: ensure confidence score matches sentiment logic
      if (parsed.data.sentiment === "negative" && parsed.data.confidenceScore < 0.4) {
        feedback = "Sentiment was marked negative, but confidence score is too low. Re-evaluate.";
        attempts++;
        continue;
      }
      return parsed.data;
    }
    
    feedback = `Failed schema validation. Errors: ${parsed.error.message}`;
    attempts++;
  }
  
  throw new Error("Failed to generate valid output after maximum attempts");
}

By wrapping API prompts with strict Zod schema parsing and feedback loops, you eliminate over 99% of formatting failures.

Prompt Engineering for Production Systems

Amarjit Singh

Prompt Engineering for Production Systems

1. Enforcing Structured Output

2. DSPy: Compiling Prompts

3. Creating Assertion Loops

Related Insights

Advanced RAG Architecture & Optimization

AI Red-Teaming: Defeating LLM Security Controls

Command Palette

Prompt Engineering for Production Systems

Amarjit Singh

Prompt Engineering for Production Systems

1. Enforcing Structured Output

2. DSPy: Compiling Prompts

3. Creating Assertion Loops

Related Insights

Advanced RAG Architecture & Optimization

AI Red-Teaming: Defeating LLM Security Controls