Ceris is serverless optimization infrastructure. It lets you solve scheduling, portfolio, and resource allocation problems via a single API call — no solver license, no infrastructure to manage, and no OR team required. You get production-grade results in seconds and pay only for compute used.

Do I need a commercial solver license?

No. Ceris runs HiGHS and OR-Tools, which are free, open-source, production-grade solvers. They handle the vast majority of optimization problems — portfolio allocation, scheduling, supply chain, resource planning — without a commercial license. If you already have a Gurobi license, you can bring it for additional solver options.

How does serverless optimization work?

Submit your optimization problem via REST API. Ceris routes it to the best available solver, handles burst compute and parallelization, and returns results with a full audit trail. No servers to provision, no cold starts to tune, no infrastructure to maintain.

What problems can Ceris solve?

Ceris handles linear programming (LP) and mixed-integer programming (MIP) problems including portfolio optimization, workforce scheduling, resource allocation, supply chain planning, energy dispatch, and production scheduling. If your problem can be expressed as an optimization model, Ceris can solve it.

How is Ceris different from Nextmv?

Nextmv provides DevOps tooling for teams that already have solver access and OR expertise — model hosting, testing, versioning. Ceris solves the access problem: it gives you production-grade optimization without needing a solver license, infrastructure, or dedicated OR team. Ceris also focuses on general optimization (scheduling, portfolio, allocation) rather than routing specifically.

gurobipy Tutorial: The Gurobi Python API Explained

If you've ever tried to solve an optimization problem in production, you've probably discovered that the gurobi python api is both powerful and intimidating. Gurobi is the gold standard for mathematical optimization, but its Python interface—gurobipy—can feel like drinking from a fire hose. You're trying to build models, but you're wrestling with environments, license tokens, and mysterious infeasibility errors.

Here's the thing: most gurobipy tutorials show you toy problems with five variables. Real optimization models have thousands of variables, complex constraints, and need to run reliably in production. This tutorial bridges that gap. We'll start with the basics, but we'll focus on patterns that actually work when your optimization problems grow up.

Installing gurobipy and Getting Your License Right

The good news: installing gurobipy is straightforward. The bad news: licensing is where most people get stuck.

pip install gurobipy

That's it for installation. Starting with Gurobi 9.1, gurobipy comes with a limited license that works for small problems (up to 2000 variables and constraints). For anything larger, you need a proper license.

License Setup Reality Check

Here's what the documentation won't tell you: Gurobi licensing in production is a procurement nightmare. You have node-locked licenses (tied to specific machines), floating licenses (shared across a network), token servers, Web License Service (WLS) for containers, and academic vs commercial tiers. Each has different deployment implications.

For local development, the academic license is generous—unlimited problem size for academic use. For production, you're typically looking at floating licenses managed by a license server, or increasingly, WLS tokens for cloud deployments.

The critical insight: license acquisition happens at environment creation, not model creation. This means you want to reuse environments when possible to avoid repeatedly hitting your license server.

Building Your First Model: Beyond the Textbook Example

Most tutorials start with the diet problem or some other toy example. Let's build something closer to reality: a supply chain optimization model where you're deciding how much inventory to hold at different warehouses.

import gurobipy as gp
from gurobipy import GRB

# This import pattern is canonical - gp.* for functions, GRB.* for constants
with gp.Env() as env, gp.Model(env=env) as model:
    
    # Decision variables: how much inventory at each warehouse
    warehouses = ['NYC', 'LA', 'Chicago', 'Houston']
    products = ['Widget_A', 'Widget_B', 'Widget_C']
    
    # Create variables with meaningful names
    inventory = model.addVars(warehouses, products, 
                            name="inventory", 
                            lb=0,  # Non-negative inventory
                            vtype=GRB.CONTINUOUS)
    
    # Constraints: capacity limits
    capacity = {'NYC': 1000, 'LA': 1500, 'Chicago': 1200, 'Houston': 800}

    model.addConstrs(
        (inventory.sum(warehouse, '*') <= capacity[warehouse] for warehouse in warehouses),
        name="capacity"
    )

    # Objective: minimize holding costs
    holding_cost = {
        ('NYC', 'Widget_A'): 2.5, ('NYC', 'Widget_B'): 1.8, ('NYC', 'Widget_C'): 3.2,
        ('LA', 'Widget_A'): 2.1, ('LA', 'Widget_B'): 1.9, ('LA', 'Widget_C'): 2.8,
        ('Chicago', 'Widget_A'): 2.3, ('Chicago', 'Widget_B'): 2.0, ('Chicago', 'Widget_C'): 3.0,
        ('Houston', 'Widget_A'): 2.4, ('Houston', 'Widget_B'): 2.2, ('Houston', 'Widget_C'): 2.9,
    }
    
    objective = gp.quicksum(holding_cost.get((w, p), 0) * inventory[w, p] 
                           for w in warehouses for p in products)
    
    model.setObjective(objective, GRB.MINIMIZE)
    
    # Solve
    model.optimize()
    
    # Extract results
    if model.status == GRB.OPTIMAL:
        for w in warehouses:
            for p in products:
                if inventory[w, p].x > 0.01:  # Only print non-zero values
                    print(f"Hold {inventory[w, p].x:.2f} units of {p} in {w}")

Notice the context manager pattern (with gp.Env() as env). This is crucial for production code—it ensures that license tokens are properly returned when you're done, whether your code succeeds or fails.

Variables, Constraints, and Objectives: API Patterns That Scale

The gurobipy API has evolved over the years, and there are now multiple ways to do everything. Here are the patterns that work best for real models:

Variable Creation Patterns

# Don't do this for large models:
x1 = model.addVar(name="x1")
x2 = model.addVar(name="x2")
# ... repeat 10,000 times

# Do this instead:
variables = model.addVars(range(1000), name="x")

# Or with meaningful indices:
production = model.addVars(plants, products, months, name="production")

# With bounds and types specified upfront:
binary_decisions = model.addVars(locations, vtype=GRB.BINARY, name="open")

Constraint Patterns That Don't Break

The addConstrs() method is your friend for adding many similar constraints:

# Production capacity constraints
model.addConstrs(
    (production.sum(plant, '*', month) <= plant_capacity[plant, month]
     for plant in plants for month in months),
    name="capacity"
)

# Demand satisfaction
model.addConstrs(
    (production.sum('*', product, month) >= demand[product, month]
     for product in products for month in months),
    name="demand"
)

Objective Function Gotchas

Gurobi supports linear and quadratic objectives. The quicksum() function is more efficient than Python's built-in sum() for large expressions:

# Inefficient for large models:
objective = sum(cost[i] * variables[i] for i in range(10000))

# Better:
objective = gp.quicksum(cost[i] * variables[i] for i in range(10000))

# Best for structured problems:
objective = production.prod(production_cost) + inventory.prod(holding_cost)

Reading Results and Diagnosing When Things Go Wrong

Here's where most tutorials fail you. Real optimization models don't always solve cleanly. Let's talk about what happens when they don't.

The Status Code Reality

model.optimize()

status_map = {
    GRB.OPTIMAL: "Optimal solution found",
    GRB.INFEASIBLE: "Model is infeasible", 
    GRB.UNBOUNDED: "Model is unbounded",
    GRB.INF_OR_UNBD: "Model is infeasible or unbounded",
    GRB.TIME_LIMIT: "Time limit reached",
    GRB.INTERRUPTED: "Optimization interrupted"
}

print(f"Status: {status_map.get(model.status, 'Unknown status')}")

if model.status == GRB.OPTIMAL:
    print(f"Objective value: {model.objVal}")
    # Extract variable values
    for var in model.getVars():
        if var.x > 1e-6:  # Only show non-zero values
            print(f"{var.varName}: {var.x}")

Infeasibility: The Debugging Nightmare

When your model is infeasible, Gurobi can compute an Irreducible Inconsistent Subsystem (IIS)—a minimal subset of constraints that make the model impossible to solve:

if model.status == GRB.INFEASIBLE:
    print("Model is infeasible. Computing IIS...")
    model.computeIIS()
    
    print("\nConstraints in IIS:")
    for constr in model.getConstrs():
        if constr.IISConstr:
            print(f"  {constr.constrName}")
    
    print("\nVariable bounds in IIS:")
    for var in model.getVars():
        if var.IISLB:
            print(f"  {var.varName} >= {var.lb}")
        if var.IISUB:
            print(f"  {var.varName} <= {var.ub}")

Pro tip: If you get "Cannot compute IIS on a feasible model" but your model status is INFEASIBLE, you're hitting a numerical edge case. Try tightening feasibility tolerances or presolving parameters.

Parameters and Tuning: Making Gurobi Work for Your Problem

Gurobi has over 100 parameters. Most you'll never touch, but a few can dramatically affect performance:

Essential Parameters for Production

# Time limits (essential for production systems)
model.setParam('TimeLimit', 300)  # 5 minutes max

# Threading (be careful in containerized environments)
model.setParam('Threads', 4)

# MIP gap tolerance (when "good enough" is good enough)
model.setParam('MIPGap', 0.01)  # 1% gap

# Logging (crucial for debugging)
model.setParam('LogToConsole', 1)
model.setParam('LogFile', 'gurobi.log')

# Presolve aggressiveness
model.setParam('Presolve', 2)  # Aggressive presolving

When Models Won't Solve: The Parameter Tuning Dance

For hard models, Gurobi includes a tuning tool that automatically searches for better parameter settings:

# Create a model, then tune it
model.tune()

if model.tuneResultCount > 0:
    model.getTuneResult(0)  # Get the best parameter set
    model.write('tuned_params.prm')  # Save for later use

The tuning tool reports a Pareto frontier—different parameter sets that achieve the best trade-off between runtime and solution quality. In production, you often want the parameter set that gives you a "good enough" solution quickly, not necessarily the optimal solution eventually.

Callbacks: When You Need Custom Logic

Callbacks let you interrupt Gurobi's solution process to inject custom logic. The most common use case: implementing custom cut generation or heuristics.

def callback_function(model, where):
    if where == GRB.Callback.MIPSOL:
        # Called when a new integer solution is found
        model_vars = model.getVars()
        solution_values = model.cbGetSolution(model_vars)

        # Check if solution meets custom business logic
        if violates_business_rule(solution_values):
            # Add a lazy constraint to cut off this solution
            selected_vars = [v for v, s in zip(model_vars, solution_values) if s > 0.5]
            if selected_vars:
                model.cbLazy(gp.quicksum(selected_vars) <= len(selected_vars) - 1)

    elif where == GRB.Callback.MIP:
        # Called periodically during MIP solve
        obj_best = model.cbGet(GRB.Callback.MIP_OBJBST)
        obj_bound = model.cbGet(GRB.Callback.MIP_OBJBND)

        # Custom termination criteria (avoid division by zero)
        if obj_best != 0 and abs(obj_best - obj_bound) / abs(obj_best) < 0.02:  # 2% gap
            model.terminate()

# Enable lazy constraints and the callback
model.setParam('LazyConstraints', 1)
model.optimize(callback_function)

Callback gotcha: When using multiple threads, the callback is still called from a single thread, so you don't need to worry about thread safety. But if you started the solve asynchronously with optimizeAsync(), the callback runs in a background thread.

Moving from Prototype to Production: The Hard Truths

Here's what changes when your optimization model graduates from Jupyter notebook to production system:

Environment Management at Scale

Don't create new environments for every solve. Reuse them:

class OptimizationService:
    def __init__(self):
        # Create environment once
        self.env = gp.Env()
        self.env.setParam('LogToConsole', 0)  # Suppress console output

    def solve_model(self, data):
        # Reuse the same environment
        with gp.Model(env=self.env) as model:
            # Build and solve model
            pass

    def close(self):
        # Clean up environment - call this explicitly when done
        if hasattr(self, 'env'):
            self.env.dispose()

Error Handling That Won't Crash Production

import time

# Define custom exceptions for optimization errors
class OptimizationError(Exception):
    pass

class InfeasibleModelError(OptimizationError):
    pass

def robust_optimize(model, max_retries=3):
    for attempt in range(max_retries):
        try:
            model.optimize()

            if model.status in [GRB.OPTIMAL, GRB.TIME_LIMIT]:
                return model
            elif model.status == GRB.INFEASIBLE:
                raise InfeasibleModelError("Model has no feasible solution")
            else:
                raise OptimizationError(f"Unexpected status: {model.status}")

        except gp.GurobiError as e:
            if attempt == max_retries - 1:
                raise
            # Wait and retry for transient errors
            time.sleep(2 ** attempt)

Performance Patterns

Large models require different patterns than small ones:

# Pre-allocate constraint matrices when possible
model.addMConstrs(A_matrix, variables, GRB.LESS_EQUAL, b_vector)

# Use model.copy() for similar models instead of rebuilding
base_model = create_base_model()
daily_models = [base_model.copy() for _ in range(30)]

# Warm starts from previous solutions
if previous_solution:
    for var_name, value in previous_solution.items():
        model.getVarByName(var_name).start = value

FAQ

What's the difference between `sum()` and `quicksum()` in gurobipy?

quicksum() is Gurobi's optimized summation function that's much faster for large expressions. Python's built-in sum() creates intermediate objects for each addition, while quicksum() builds the expression more efficiently. For models with thousands of variables, this can be the difference between seconds and minutes of model build time.

Why do I get "Model too large for size-limited license" with gurobipy?

The default license that comes with pip install gurobipy is limited to 2000 variables and 2000 constraints. This is fine for learning and small prototypes, but real optimization problems quickly exceed these limits. You'll need an academic license (free for universities) or a commercial license for larger problems.

How do I debug infeasible models when IIS computation fails?

Sometimes computeIIS() fails with "Cannot compute IIS on a feasible model" even when status is INFEASIBLE. This happens at the numerical boundary between feasible and infeasible. Try: (1) tightening the feasibility tolerance with FeasibilityTol parameter, (2) using FeasRelaxS() to find a minimal relaxation, or (3) manually removing constraints until you find feasibility, then adding them back one by one.

Should I create a new Gurobi environment for each optimization solve?

No. Environment creation is expensive—it involves license acquisition and initialization overhead. In production, create one environment and reuse it across multiple solves. Use the context manager pattern (with gp.Model(env=env)) to ensure models are properly disposed while keeping the environment alive.

When should I use callbacks versus just running optimization with parameters?

Use callbacks when you need dynamic decision-making during the solve: custom cut generation, problem-specific heuristics, or business logic that can't be expressed as standard constraints. For everything else—time limits, gap tolerances, threading—use parameters. Callbacks add complexity and can slow down the solver if not implemented carefully.

This is the reality of working with the Gurobi Python API in production. It's powerful, but it requires understanding the subtleties that tutorials skip. You'll spend time on environment management, license token handling, and debugging infeasible models that have nothing to do with the optimization problem you're trying to solve.

That infrastructure complexity is exactly what Ceris eliminates. Instead of managing Gurobi environments, license servers, and deployment pipelines, you send your model to an API and get results back. All the patterns in this tutorial still apply to your model-building code—you just don't need to worry about the infrastructure anymore.