Applied Intuition: Platform Architecture & Simulation Technology

Type: Company & Platform Deep Dive Focus: Autonomous Vehicle Simulation, Neural Rendering, Synthetic Data Audience: Engineers preparing to join Applied Intuition's Neuro Sim / Synthetic Data team Last Updated: March 2026

Executive Summary
Company Background
Core Product Suite
Neural Sim Technical Architecture
Synthetic Data Technical Architecture
Sensor Simulation Engine
Defense & Multi-Domain Expansion
Competitive Landscape
Team Structure & What to Expect
Onboarding Preparation
Interview Questions
References

1. Executive Summary

+======================================================================+
|                     APPLIED INTUITION AT A GLANCE                    |
+======================================================================+
|                                                                      |
|  Founded:        2017, Mountain View, CA                             |
|  Founders:       Qasar Younis (CEO), Peter Ludwig (CTO)             |
|  Mission:        Accelerate safe and intelligent machines            |
|  Valuation:      ~$15 billion (Series E+, 2024-2025)                |
|  ARR:            ~$415 million (estimated, 2025)                     |
|  Employees:      ~1,800+ (2025)                                     |
|  Customers:      18 of top 20 global OEMs                           |
|  Industries:     Auto, Trucking, Mining, Construction, Ag, Defense   |
|                                                                      |
|  Core Offering:  End-to-end simulation, validation, and autonomy    |
|                  infrastructure for autonomous & intelligent machines |
|                                                                      |
|  Key Insight:    "The picks-and-shovels" play for autonomy --       |
|                  sell the tools, not the self-driving car itself     |
|                                                                      |
+======================================================================+

Applied Intuition is the dominant infrastructure provider for autonomous vehicle development. While companies like Waymo and Cruise build self-driving fleets, Applied Intuition builds the simulation, validation, and data tools that nearly every major automaker uses to develop and test their own autonomy stacks. Think of them as the "AWS of autonomous driving" -- the platform layer that everyone builds on top of.

The company has achieved a rare combination in enterprise software: extremely high customer concentration among tier-1 OEMs (Toyota, Volkswagen Group, Porsche, Nissan, Isuzu, TRATON), rapid revenue growth, and deep technical moats built on custom rendering engines, neural reconstruction, and massive-scale cloud simulation.

Why This Matters

Traditional AV Testing:
  - 1 test vehicle drives 100 miles/day
  - $500K+ per vehicle per year
  - 11 billion miles needed for statistical safety proof (RAND)
  - At 100 miles/day = 301,369 YEARS per vehicle

Applied Intuition's Approach:
  - 1,000+ concurrent simulations in the cloud
  - Millions of miles simulated per day
  - Scenarios can target rare edge cases specifically
  - Cost: fraction of physical testing
  - Time: days instead of decades

2. Company Background

2.1 Founding Story

Applied Intuition was founded in 2017 by Qasar Younis and Peter Ludwig, two engineers who recognized a critical gap in the autonomous vehicle industry.

Qasar Younis (CEO) previously served as COO of Y Combinator, where he saw hundreds of startups struggle with the same fundamental challenge: building reliable software for physical-world systems. Before YC, he worked at General Motors on early connected vehicle platforms, giving him direct insight into OEM pain points. Qasar's thesis was simple but powerful: the autonomous driving industry was building the same internal tools over and over again, poorly, and a dedicated infrastructure company could serve all of them better.

Peter Ludwig (CTO) came from a deep engineering background in simulation and graphics. He had worked on simulation systems at defense contractors and understood that high-fidelity sensor simulation was the key bottleneck preventing faster AV development. Peter's insight was that the gaming industry's rendering technology (ray tracing, GPU compute) could be adapted to create physically accurate sensor models for autonomous systems.

Together, they identified that:

Every AV company was building its own simulation stack from scratch
These internal tools were typically 2-3 years behind what a dedicated team could build
OEMs (not just robotaxis) would need simulation as ADAS evolved toward autonomy
The market for "picks and shovels" was larger and more durable than any single AV play

2.2 Mission: Accelerate Safe and Intelligent Machines

Applied Intuition's mission extends beyond just autonomous cars. Their stated goal is to build the digital infrastructure enabling machines to "perceive, reason, and act in the real world." This deliberately broad framing covers:

Passenger vehicles (L2++ through L4)
Commercial trucking (highway autonomy)
Mining and construction equipment (Komatsu partnership)
Agricultural machinery
Defense and military systems (ground, air, maritime)
General robotics and embodied AI

2.3 Business Model: B2B SaaS for OEMs

+----------------------------------------------------------------------+
|                    BUSINESS MODEL ARCHITECTURE                        |
+----------------------------------------------------------------------+
|                                                                       |
|  Revenue Streams:                                                     |
|  +------------------+  +------------------+  +------------------+     |
|  | Platform License |  | Professional     |  | Managed          |     |
|  | (Recurring SaaS) |  | Services         |  | Development      |     |
|  | - Per-seat       |  | - Integration    |  | - SDS program    |     |
|  | - Per-compute    |  | - Customization  |  | - Custom stacks  |     |
|  | - Per-module     |  | - Training       |  | - Deployments    |     |
|  +------------------+  +------------------+  +------------------+     |
|           |                     |                     |               |
|           v                     v                     v               |
|  +--------------------------------------------------------------+    |
|  |                    CUSTOMER SEGMENTS                          |    |
|  |  Tier 1: Global OEMs (Toyota, VW, Porsche, Nissan...)       |    |
|  |  Tier 2: Tier-1 Suppliers (Valeo, Luminar, Continental)     |    |
|  |  Tier 3: AV Startups & Tech Companies                       |    |
|  |  Tier 4: Defense & Government (Northrop, US Army)           |    |
|  +--------------------------------------------------------------+    |
|                                                                       |
+----------------------------------------------------------------------+

Key business model characteristics:

Land-and-expand: Start with one simulation module, then upsell the full platform
High switching costs: Once engineers build workflows around the toolchain, migration is extremely costly
Multi-year contracts: Enterprise deals typically 3-5 year commitments
Compute revenue: Cloud Engine usage scales with customer testing volume
White-box collaboration: Customers retain full visibility into their own stacks, avoiding vendor lock-in fears while still creating deep platform dependency

2.4 Market Position

Metric	Value	Context
OEM Penetration	18/20 top global OEMs	Near-monopoly in Tier-1 automotive
Valuation	~$15B	Makes it one of the most valuable private companies in autonomy
Est. ARR	~$415M	Rapid growth from ~$100M in 2022
Funding	$6.2B+ total raised	Series E at $6B valuation (2023), Series E extension (2024)
Revenue Growth	~100% YoY (2022-2024)	Exceptional for enterprise software
Net Revenue Retention	>150% (estimated)	Strong expansion within existing accounts

2.5 Key Partnerships

Partner	Type	Details
Isuzu	OEM Customer	Trucking autonomy development
TRATON Group	OEM Customer	Scania, MAN, Navistar truck brands
Toyota	OEM Customer	ADAS and autonomy validation
Volkswagen Group	OEM Customer	Including Audi, Porsche brands
Valeo	Tier-1 Supplier	Sensor integration and validation
Luminar	LiDAR Partner	Sensor model integration
OpenAI	Technology Partner	AI/ML collaboration for generative scenarios
Northrop Grumman	Defense Partner	Beacon program, autonomous systems
Komatsu	Industrial Customer	Mining and construction autonomy
Microsoft Azure	Cloud Partner	Infrastructure for Cloud Engine

2.6 Acquisitions

Company	Year	Strategic Rationale
EpiSci	2024	Tactical AI for defense; autonomous decision-making for aircraft and ground vehicles; strengthened defense portfolio (Acuity product line)
Reblika Technologies	2023	3D environment reconstruction and neural rendering; accelerated Neural Sim capabilities; team expertise in NeRF and Gaussian Splatting

The EpiSci acquisition was particularly significant, bringing onboard AI systems that had been demonstrated in live DARPA programs for autonomous air combat. This gave Applied Intuition instant credibility and technical depth in the defense autonomous systems market.

3. Core Product Suite

Applied Intuition's product portfolio is organized into three pillars: Simulation Tools, Data Tools, and Autonomy Stacks.

+======================================================================+
|                   APPLIED INTUITION PRODUCT MAP                       |
+======================================================================+
|                                                                       |
|  SIMULATION TOOLS          DATA TOOLS           AUTONOMY STACKS      |
|  +-----------------+       +----------------+   +----------------+   |
|  | Object Sim      |       | Data Explorer  |   | Self-Driving   |   |
|  | Sensor Sim      |       | Synthetic      |   | System (SDS)   |   |
|  | Neural Sim      |       |   Datasets     |   |   - Auto       |   |
|  | Log Sim         |       | Validation     |   |   - Truck      |   |
|  | VehicleSim      |       |   Toolset      |   |   - Industrial |   |
|  | HIL Sim         |       | Map Toolset    |   |                |   |
|  | Cloud Engine    |       | Copilot        |   | Vehicle OS     |   |
|  +-----------------+       +----------------+   +----------------+   |
|                                                                       |
|  DEFENSE (Applied Intuition Defense)                                  |
|  +--------------------------------------------------------------+    |
|  | Axion (mission-critical toolchain)                           |    |
|  | Acuity (onboard autonomy software)                           |    |
|  +--------------------------------------------------------------+    |
|                                                                       |
+======================================================================+

3.1 Object Sim

Purpose: Scenario-based functional testing of the autonomy stack at the behavior/planning level, abstracting away sensor-level details.

Key Capabilities:
  - ISO 26262 TCL 3 certified (Tool Confidence Level 3)
  - Deterministic scenario execution
  - OpenSCENARIO 1.x and 2.0 support
  - Custom scenario scripting language
  - Traffic agent behavior models (IDM, MOBIL, learned)
  - Map-aware scenario generation
  - Parameterized scenario sweeps (e.g., vary speed, distance, weather)

How It Works:

                    Object Sim Pipeline
                    ===================

  Scenario Definition          Execution Engine         Analysis
  +------------------+        +------------------+     +------------------+
  | - Initial state  |  --->  | - Physics engine |---> | - Pass/fail      |
  | - Actor paths    |        | - Traffic models |     | - Metrics        |
  | - Trigger events |        | - Map geometry   |     | - Coverage stats |
  | - Success/fail   |        | - Object-level   |     | - Regression     |
  |   criteria       |        |   ground truth   |     |   comparison     |
  +------------------+        +------------------+     +------------------+
                                     |
                              No sensor rendering
                              (bounding boxes only)

ISO 26262 TCL 3 Certification means Object Sim has been independently verified to be suitable for use in safety-critical development workflows. This is a significant competitive differentiator -- most open-source simulators (CARLA, LGSVL) lack any ISO certification.

Typical Use Cases:

Planning algorithm regression testing (did the new planner break existing scenarios?)
Safety scenario coverage (Euro NCAP, SOTIF, NHTSA pre-crash typologies)
Behavioral requirement verification (does the car yield at unprotected lefts?)
Large-scale parameter sweeps (test 10,000 variations of a cut-in scenario)

3.2 Sensor Sim

Purpose: High-fidelity, physics-based sensor simulation for perception algorithm testing.

Key Capabilities:
  - Custom Vulkan-based rendering engine (not Unity/Unreal)
  - Real-time ray tracing for physically accurate light transport
  - Multi-spectral rendering (visible, NIR, thermal/LWIR)
  - Hardware-specific sensor models:
    * Camera: lens distortion, rolling shutter, HDR, noise models
    * LiDAR: beam patterns, intensity, multi-return, crosstalk
    * Radar: RCS modeling, multi-path, Doppler, clutter
    * Ultrasonic: near-field modeling, temperature effects
  - Procedural and artist-built 3D asset libraries
  - Weather simulation (rain, fog, snow, sun glare)
  - Time-of-day with physically accurate sky models

Why Custom Vulkan (Not Unity/Unreal)?

Aspect	Game Engine (Unity/Unreal)	Custom Vulkan Engine
Determinism	Non-deterministic rendering	Bit-exact deterministic
Sensor Physics	Approximated for visual appeal	Modeled for physical accuracy
Ray Tracing Control	Black-box pipeline	Full control over ray behavior
LiDAR Simulation	Point cloud from depth buffer (hack)	True time-of-flight ray tracing
Radar Simulation	Not supported natively	EM wave propagation models
Performance	General-purpose overhead	Optimized for sensor workloads
Rolling Shutter	Visual effect only	Physically accurate per-scanline timing
Multi-spectral	Visible light only	NIR, LWIR, multi-band

Section 6 provides a much deeper technical dive into the sensor simulation engine.

3.3 Neural Sim

Purpose: Reconstruct real-world driving scenarios from log data using neural rendering, enabling photorealistic re-simulation with novel viewpoints and actor manipulations.

Key Capabilities:
  - Gaussian Splatting for static background reconstruction
  - PBR (Physically Based Rendering) for dynamic actor insertion
  - Multi-sensor output: camera, LiDAR, radar (from one reconstruction)
  - Automated ML pipeline: drive logs -> 3D scene in hours (not weeks)
  - Novel viewpoint synthesis (shift ego position, change lane)
  - Actor removal and insertion (add/remove vehicles, pedestrians)
  - Temporal consistency across frames
  - Daily use in SDS program for validation

Why Neural Sim Matters:

The fundamental challenge of AV simulation is the reality gap -- the difference between simulated sensor data and real-world sensor data. Neural Sim attacks this problem by reconstructing scenes from real data, inherently capturing real-world textures, lighting, and geometry that are extremely difficult to model procedurally.

Traditional Sensor Sim:         Neural Sim:

  3D Artist Models               Real Drive Logs
       |                              |
       v                              v
  Handcrafted Scene              ML Reconstruction
       |                              |
       v                              v
  Rendered Output                Neural Rendered Output
       |                              |
       v                              v
  "Looks realistic"              "IS real (reconstructed)"
  but has domain gap             minimal domain gap

Section 4 provides the full technical deep dive into Neural Sim's architecture.

3.4 Log Sim

Purpose: Re-simulate recorded driving logs with modifications, enabling counterfactual analysis ("what would have happened if...").

Key Capabilities:
  - Replay real sensor data with modified ego behavior
  - Extract scenarios from fleet logs automatically
  - Inject perturbations (actor speed changes, trajectory modifications)
  - Compare original vs. modified outcomes
  - Integration with Data Explorer for log selection
  - Supports both open-loop and closed-loop replay

Open-Loop vs. Closed-Loop Log Sim:

Open-Loop Log Sim:
  - Replay exact recorded sensor data
  - Feed to perception/planning stack
  - Compare planned trajectory to actual driven trajectory
  - Pro: Perfect sensor fidelity (it's real data!)
  - Con: Other actors don't react to ego changes

Closed-Loop Log Sim (requires Neural Sim or Sensor Sim):
  - Modify ego behavior
  - Re-render sensor data from new viewpoint
  - Other actors can react (via behavior models)
  - Pro: Tests true closed-loop behavior
  - Con: Requires high-quality re-rendering

3.5 Synthetic Datasets

Purpose: Generate automatically labeled training data for perception models at scale, reducing dependence on expensive human annotation of real-world data.

Key Capabilities:
  - Automatic ground-truth labels (zero human annotation cost):
    * 2D bounding boxes
    * 3D bounding boxes
    * Instance segmentation
    * Semantic segmentation
    * Panoptic segmentation
    * Depth maps
    * Optical flow
    * Surface normals
    * Lane markings and boundaries
  - Configurable scene generation (weather, time, traffic density)
  - Distribution-based generation (match real-world statistics)
  - Domain gap mitigation (FDA, CyCADA, style transfer)
  - Format compatibility (KITTI, nuScenes, COCO, custom)

Section 5 provides the full technical deep dive into the Synthetic Data pipeline.

3.6 Cloud Engine

Purpose: Massively parallel simulation orchestration in the cloud, enabling thousands of concurrent test runs.

Key Capabilities:
  - Parallel execution of 1,000+ concurrent simulations
  - Auto-scaling compute allocation
  - Job scheduling and prioritization
  - Result aggregation and reporting
  - CI/CD integration (trigger sims on code commits)
  - Multi-cloud support (AWS, Azure, GCP)
  - Cost optimization (spot instances, preemptible VMs)
  - Deterministic execution guarantees

Architecture:

+----------------------------------------------------------------------+
|                        CLOUD ENGINE                                   |
+----------------------------------------------------------------------+
|                                                                       |
|  +------------------+     +-------------------+                       |
|  | Job Scheduler    | --> | Resource Allocator |                      |
|  | - Priority queue |     | - GPU pools        |                      |
|  | - Dependencies   |     | - CPU pools        |                      |
|  | - Retry logic    |     | - Spot/preemptible |                      |
|  +------------------+     +-------------------+                       |
|           |                        |                                  |
|           v                        v                                  |
|  +--------------------------------------------------+                |
|  |              Execution Cluster                    |                |
|  |  +--------+  +--------+  +--------+  +--------+  |                |
|  |  | Sim #1 |  | Sim #2 |  | Sim #3 |  | ...    |  |                |
|  |  | GPU    |  | GPU    |  | CPU    |  |        |  |                |
|  |  +--------+  +--------+  +--------+  +--------+  |                |
|  +--------------------------------------------------+                |
|           |                                                           |
|           v                                                           |
|  +--------------------------------------------------+                |
|  |           Result Aggregation & Storage            |                |
|  |  - Metrics DB    - Artifact storage               |                |
|  |  - Dashboards    - Regression alerts              |                |
|  +--------------------------------------------------+                |
|                                                                       |
+----------------------------------------------------------------------+

Scale Numbers:

Customers routinely run 10,000+ scenarios per nightly regression
SDS program runs millions of simulated miles per week
Sub-second scheduling latency for new jobs
Results available within minutes of batch completion

3.7 Data Explorer

Purpose: Fleet data management platform for ingesting, indexing, searching, and analyzing petabytes of driving data.

Key Capabilities:
  - Ingest data from heterogeneous fleet sensors
  - Automatic metadata extraction and indexing
  - Natural language and structured queries
  - Anomaly detection (identify interesting/rare events)
  - Scene understanding and classification
  - Integration with Log Sim for scenario extraction
  - Handles hundreds of petabytes of data
  - Map-based spatial queries ("find all logs near intersection X")

Why It Matters: The most valuable asset an AV company has is its driving data. But raw data is useless without the ability to find specific scenarios, identify edge cases, and extract training examples. Data Explorer turns raw fleet data into a searchable, queryable knowledge base.

3.8 Validation Toolset

Purpose: Verification and validation (V&V) management for regulatory compliance and safety assurance.

Key Capabilities:
  - Requirements management and traceability
  - Test case management linked to requirements
  - Coverage tracking (which requirements are tested?)
  - ISO 26262 / SOTIF / UL 4600 compliance workflows
  - Automated report generation for regulatory submissions
  - Integration with Object Sim and Cloud Engine
  - Audit trail and version control

Regulatory Landscape:

+----------------------------------------------------------------------+
|                    KEY SAFETY STANDARDS                                |
+----------------------------------------------------------------------+
|                                                                       |
|  ISO 26262    Functional Safety (hardware + software)                 |
|  ISO 21448    SOTIF (Safety of the Intended Functionality)            |
|  UL 4600      Safety for Autonomous Products                          |
|  ISO 34502    Scenario-based testing methodology                      |
|  UN R157      Automated Lane Keeping Systems (ALKS)                   |
|  Euro NCAP    Consumer safety rating (increasingly requires ADAS)     |
|  NHTSA ADS    US federal framework for autonomous driving systems     |
|                                                                       |
+----------------------------------------------------------------------+

3.9 Map Toolset

Purpose: Create, edit, and maintain HD maps for simulation and deployment.

Key Capabilities:
  - Generative AI for automated map creation from satellite/aerial imagery
  - Manual editing tools for fine-tuning
  - OpenDRIVE and Lanelet2 format support
  - Automatic lane graph extraction
  - Traffic sign and signal placement
  - Integration with Object Sim and Sensor Sim
  - Cloud-native HD map management (announced Jan 2026)
  - Real-time map updates from fleet data

The January 2026 blog post "Building for Scale: Cloud-Native HD Maps" described their approach to making HD maps a cloud-native service rather than static files, enabling continuous updates and versioning at scale.

3.10 Copilot

Purpose: Natural language interface for scenario generation and simulation control, dramatically reducing the time to create test scenarios.

Key Capabilities:
  - Natural language to scenario conversion
  - "Create a scenario where a pedestrian jaywalks from behind a parked van
     while it's raining at night on a 4-lane road with a 45 mph speed limit"
  - 40x faster scenario creation vs. manual scripting
  - Iterative refinement through conversation
  - Automatic parameterization suggestions
  - Integration with Object Sim and Sensor Sim
  - Bulk scenario generation from natural language descriptions

Example Interaction:

User:   "Generate a scenario where a cyclist runs a red light at an
         intersection while the ego vehicle has a green light and is
         turning right."

Copilot: Generating scenario with parameters:
         - Intersection type: signalized, 4-way
         - Ego: right turn, green light, 15 mph
         - Cyclist: cross-traffic, running red, 12 mph
         - Conflict point: ego's turning path

         Suggested parameter sweeps:
         - Cyclist speed: [8, 10, 12, 15, 18] mph
         - Ego speed: [10, 15, 20, 25] mph
         - Cyclist entry time offset: [-2s, -1s, 0s, +1s, +2s]

         Total scenarios: 125 (5 x 4 x 5 grid)

         [Generate] [Modify] [Add complexity]

3.11 Self-Driving System (SDS)

Purpose: Applied Intuition's own full-stack autonomy system, offered as a turnkey solution for OEMs who want autonomy capabilities without building from scratch.

Key Capabilities:
  - L2++ to L4 autonomy levels
  - Modular architecture (perception, prediction, planning, control)
  - Variants for automotive, trucking, and industrial
  - Hardware-agnostic (supports multiple compute platforms)
  - Sensor-agnostic (camera-first, with LiDAR/radar fusion options)
  - OTA-updatable via Vehicle OS
  - Validated using Applied Intuition's own simulation tools
  - European launch announced January 2026

Strategic Significance: SDS represents Applied Intuition's evolution from pure tools provider to autonomy stack provider. This is both a growth opportunity and a potential tension point -- they now compete with some of their tools customers. However, the SDS program also serves as the most demanding internal customer for their simulation tools, driving rapid improvement of the entire platform.

SDS Architecture (Simplified):

  Sensors -----> Perception -----> Prediction -----> Planning -----> Control
  (Camera,       (Detection,      (Trajectory       (Route,         (Steering,
   LiDAR,        Tracking,        Forecasting,      Behavior,       Throttle,
   Radar)        Classification)  Intent Pred.)     Motion Plan)    Braking)
                      |                |                  |
                      v                v                  v
                 Validated daily using Neural Sim + Object Sim + Cloud Engine

4. Neural Sim Technical Architecture

Neural Sim is arguably Applied Intuition's most technically sophisticated product and a key differentiator. It sits at the intersection of computer vision, neural rendering, and autonomous driving simulation.

4.1 The Core Problem

Traditional simulation faces a fundamental dilemma:

+----------------------------------------------------------------------+
|                    THE SIMULATION DILEMMA                              |
+----------------------------------------------------------------------+
|                                                                       |
|  High Fidelity (Sensor Sim)          High Realism (Log Replay)       |
|  + Fully controllable                + Real sensor data               |
|  + Any scenario possible             + No domain gap                  |
|  + Ground truth labels               + Captures real-world complexity |
|  - Domain gap to reality             - Can't modify scenes            |
|  - Expensive asset creation          - Limited to recorded scenarios  |
|  - Hard to match real sensors        - No ground truth labels         |
|                                                                       |
|                    Neural Sim: Best of Both                           |
|                    + Real-world appearance                             |
|                    + Modifiable (novel views, actor changes)           |
|                    + Multi-sensor (camera + LiDAR + radar)            |
|                    + Automated reconstruction                         |
|                                                                       |
+----------------------------------------------------------------------+

4.2 Scene Reconstruction Pipeline

The Neural Sim pipeline transforms raw drive logs into fully manipulable 3D scenes:

Drive Log Input                    Scene Reconstruction              Simulation Output
=================                  ======================            =================

+---------------+                  +--------------------+            +----------------+
| Camera frames |                  | 1. Static Scene    |            | Novel viewpoint|
| (6-12 cameras)|  ----+           |    Reconstruction   |   ----+   | camera images  |
+---------------+      |           |    (Gaussian Splat) |        |   +----------------+
                       |           +--------------------+         |
+---------------+      |                                          |   +----------------+
| LiDAR scans  |  ----+----->  +--------------------+    -------> |   | Synthetic      |
| (1-5 LiDARs) |      |       | 2. Dynamic Actor    |        |   |   | LiDAR points   |
+---------------+      |       |    Reconstruction   |        |   |   +----------------+
                       |       |    (PBR meshes)     |        |   |
+---------------+      |       +--------------------+         |   |   +----------------+
| GPS/IMU poses |  ----+                                      |   +-> | Radar returns  |
+---------------+      |       +--------------------+         |       +----------------+
                       |       | 3. Scene Graph      |        |
+---------------+      |       |    Assembly         | -------+       +----------------+
| Radar returns |  ----+       |    (actors + BG)    |        +-----> | Modified       |
+---------------+              +--------------------+                 | scenarios      |
                                                                      +----------------+

Step 1: Static Background Reconstruction (Gaussian Splatting)

The static elements of the scene (road surface, buildings, vegetation, sky) are reconstructed using 3D Gaussian Splatting (3DGS):

# Conceptual pipeline for static scene reconstruction

class StaticSceneReconstructor:
    """
    Reconstructs static background using 3D Gaussian Splatting.

    Input: Multi-view camera images + LiDAR point clouds + poses
    Output: Set of 3D Gaussians representing the static scene
    """

    def __init__(self, config):
        self.num_gaussians = config.initial_gaussians  # ~500K-2M
        self.sh_degree = config.sh_degree              # Spherical harmonics degree (3)
        self.learning_rate = config.lr                  # Adaptive per-parameter

    def preprocess(self, drive_log):
        """Remove dynamic objects, align frames, estimate poses."""
        # 1. Run object detection to identify dynamic actors
        dynamic_masks = self.detect_dynamic_objects(drive_log.images)

        # 2. Inpaint masked regions (remove cars, pedestrians, etc.)
        static_images = self.inpaint(drive_log.images, dynamic_masks)

        # 3. Filter LiDAR points belonging to dynamic objects
        static_lidar = self.filter_dynamic_points(drive_log.lidar, dynamic_masks)

        # 4. Refine camera poses using structure-from-motion
        refined_poses = self.refine_poses(static_images, drive_log.poses)

        return static_images, static_lidar, refined_poses

    def initialize_gaussians(self, static_lidar, refined_poses):
        """Initialize Gaussians from LiDAR point cloud."""
        # Each Gaussian has:
        #   - Position (x, y, z): 3 params
        #   - Covariance (scale + rotation): 7 params
        #   - Opacity: 1 param
        #   - Color (spherical harmonics): (sh_degree+1)^2 * 3 params
        #                                  = 48 params for degree 3
        # Total: ~59 parameters per Gaussian

        gaussians = GaussianCloud(
            positions=static_lidar.points,      # Initialize at LiDAR points
            scales=torch.ones(N, 3) * 0.01,     # Small initial scale
            rotations=torch.zeros(N, 4),         # Identity quaternions
            opacities=torch.ones(N) * 0.5,       # Half-transparent initially
            sh_coeffs=torch.zeros(N, 48),         # Neutral color
        )
        return gaussians

    def optimize(self, gaussians, images, poses, num_iterations=30000):
        """Optimize Gaussians to minimize photometric loss."""
        for iteration in range(num_iterations):
            # 1. Randomly sample a training view
            view_idx = random.choice(range(len(images)))
            target_image = images[view_idx]
            camera_pose = poses[view_idx]

            # 2. Render Gaussians from this viewpoint (differentiable)
            rendered_image = self.differentiable_render(gaussians, camera_pose)

            # 3. Compute loss (L1 + D-SSIM)
            loss = 0.8 * l1_loss(rendered_image, target_image) + \
                   0.2 * (1.0 - ssim(rendered_image, target_image))

            # 4. Backpropagate and update Gaussian parameters
            loss.backward()
            self.optimizer.step()

            # 5. Adaptive density control (split/clone/prune)
            if iteration % 100 == 0:
                self.densify_and_prune(gaussians)

        return gaussians

Why Gaussian Splatting over NeRF?

Property	NeRF	3D Gaussian Splatting
Rendering Speed	~1-5 FPS (ray marching)	100-200+ FPS (rasterization)
Training Time	Hours to days	Minutes to hours
Explicit Geometry	Implicit (MLP weights)	Explicit (point cloud)
Editing	Very difficult	Straightforward (move/delete Gaussians)
LiDAR Simulation	Requires expensive ray casting	Direct point sampling
Memory	Compact (MLP weights)	Larger (millions of Gaussians)
Quality	Excellent	Excellent (comparable or better)

For simulation, the advantages of Gaussian Splatting are decisive:

Real-time rendering is essential for closed-loop simulation
Explicit geometry enables LiDAR simulation (cast rays against Gaussians)
Editability allows removing/adding actors
Fast training enables automated pipeline processing

Step 2: Dynamic Actor Reconstruction (PBR)

Dynamic objects (vehicles, pedestrians, cyclists) are handled differently from the static background:

Why PBR for Dynamic Actors (instead of Gaussians)?
===================================================

1. CONTROLLABILITY: PBR meshes can be rigged and animated
   - Change vehicle speed, trajectory, pose
   - Animate pedestrian walking/running
   - Articulate cyclist pedaling

2. RELIGHTING: PBR materials respond correctly to lighting changes
   - Time-of-day changes
   - Weather changes
   - Headlight/taillight interactions

3. COMPOSABILITY: PBR actors can be inserted into ANY scene
   - Not tied to a specific reconstruction
   - Asset library can be reused across scenarios

4. GROUND TRUTH: PBR meshes provide perfect labels
   - Exact 3D bounding boxes
   - Pixel-perfect segmentation
   - Accurate depth and surface normals

Pipeline:
  Drive Log --> Detect Actors --> Track Across Frames -->
  --> Estimate 3D Shape (mesh reconstruction) -->
  --> Estimate Material Properties (diffuse, specular, roughness) -->
  --> Fit to PBR Material Model -->
  --> Rigged, Animatable Actor Asset

Step 3: Multi-Sensor Rendering

A key innovation in Neural Sim is generating consistent outputs across multiple sensor modalities from a single scene reconstruction:

                    Unified 3D Scene
                    (Gaussians + PBR Actors)
                           |
              +------------+------------+
              |            |            |
              v            v            v
         +---------+  +---------+  +---------+
         | Camera  |  | LiDAR   |  | Radar   |
         | Render  |  | Render  |  | Render  |
         +---------+  +---------+  +---------+
              |            |            |
              v            v            v
         RGB Image    Point Cloud   Radar Returns
         (splatting)  (ray-Gaussian  (RCS from
                       intersection)  material props)

Camera Rendering:
  - Differentiable Gaussian splatting
  - Per-camera lens model (distortion, vignetting)
  - Rolling shutter simulation
  - Exposure and gain modeling

LiDAR Rendering:
  - Ray casting against Gaussian ellipsoids
  - Per-beam firing pattern (Velodyne, Ouster, Hesai, etc.)
  - Intensity from material reflectance
  - Range noise modeling
  - Multi-return simulation

Radar Rendering:
  - RCS (Radar Cross Section) from geometry + material
  - Range-Doppler processing
  - Multi-path reflections (via ray tracing)
  - Clutter and noise modeling

4.3 Automated ML Pipeline

One of Neural Sim's key differentiators is automation. While academic neural rendering requires manual tuning per scene, Applied Intuition has built an automated pipeline:

+----------------------------------------------------------------------+
|                 NEURAL SIM AUTOMATED PIPELINE                         |
+----------------------------------------------------------------------+
|                                                                       |
|  Fleet Data         Auto-Processing        Quality Assurance          |
|  =========          ===============        ==================         |
|                                                                       |
|  Drive logs    -->  1. Pose estimation  -->  Automated QA metrics:    |
|  uploaded           2. Object detection      - PSNR > threshold      |
|  nightly            3. Dynamic masking       - SSIM > threshold      |
|                     4. LiDAR filtering       - LPIPS < threshold     |
|                     5. Gaussian fitting      - Temporal consistency   |
|                     6. Actor reconstruction  - LiDAR point accuracy  |
|                     7. Scene assembly        - Semantic correctness  |
|                     8. Multi-sensor render                           |
|                                                                       |
|  Timeline:  Log upload -> Reconstructed scene: ~4-8 hours            |
|             (fully automated, no human intervention)                  |
|                                                                       |
+----------------------------------------------------------------------+

4.4 Validation Metrics

Neural Sim quality is measured on both upstream and downstream metrics:

Upstream Metrics (rendering quality):

Metric	What It Measures	Target
PSNR	Pixel-level reconstruction accuracy	> 28 dB
SSIM	Structural similarity to ground truth	> 0.90
LPIPS	Perceptual similarity (learned metric)	< 0.15
FID	Distribution-level realism	< 30
LiDAR Chamfer Distance	Point cloud accuracy	< 0.1m
Temporal Consistency	Flickering/jitter between frames	Low variance

Downstream Metrics (impact on AV stack performance):

Metric	What It Measures
Detection mAP delta	Does detection accuracy change when tested on Neural Sim vs. real data?
Tracking MOTA delta	Does tracking performance degrade in Neural Sim?
Planning decision agreement	Does the planner make the same decisions in Neural Sim vs. real logs?
False intervention rate	Does Neural Sim produce realistic disengagement patterns?

The downstream metrics are what truly matter. A reconstruction can look perfect (high PSNR) but still fool the perception stack in unrealistic ways. Applied Intuition validates that their Neural Sim outputs produce statistically equivalent AV stack behavior compared to real-world data.

4.5 Daily Validation in SDS Program

Neural Sim is used daily in Applied Intuition's own SDS (Self-Driving System) program:

Daily SDS Validation Loop:

  1. Engineers commit code changes to perception/planning
  2. CI/CD triggers Cloud Engine jobs
  3. Neural Sim reconstructs N scenes from recent fleet logs
  4. Modified perception/planning runs on Neural Sim scenes
  5. Metrics compared to baseline (previous release)
  6. Regressions flagged automatically
  7. Engineers investigate flagged scenarios
  8. Fix, re-test, merge

  Scale: Thousands of Neural Sim scenarios per day
  Latency: Results available within 1-2 hours of code commit

4.6 Research Frontier: S2GO (ICLR 2026)

Applied Intuition published "S2GO: Streaming Sparse Gaussian Occupancy" at ICLR 2026, advancing the state of the art in 3D scene understanding for autonomous systems. This research directly feeds back into Neural Sim capabilities:

Sparse Gaussian Occupancy: More efficient scene representation than dense voxels
Streaming: Processes data incrementally (important for real-time applications)
Occupancy Prediction: Predicts which parts of 3D space are occupied -- critical for planning and safety

5. Synthetic Data Technical Architecture

Synthetic data generation is one of Applied Intuition's fastest-growing product areas, addressing the enormous cost of human-annotated training data for perception models.

5.1 The Data Problem in AV Development

The Economics of Real-World Annotation:
=======================================

  1 hour of driving data:
    - ~360,000 camera frames (6 cameras x 10 Hz x 60 min)
    - Human annotation cost: $0.50 - $5.00 per frame
    - Total: $180K - $1.8M per hour of driving
    - Quality: inconsistent, slow turnaround (weeks)

  Synthetic data from Applied Intuition:
    - Unlimited frames from simulation
    - Annotation: FREE (automatic ground truth)
    - Quality: pixel-perfect, consistent
    - Turnaround: hours
    - Challenge: domain gap (synthetic != real)

5.2 Generation Pipeline

The synthetic data generation pipeline has four stages:

+----------------------------------------------------------------------+
|              SYNTHETIC DATA GENERATION PIPELINE                       |
+----------------------------------------------------------------------+
|                                                                       |
|  Stage 1: DEFINITION    Stage 2: COMPILATION   Stage 3: SIMULATION   |
|  ==================     ==================     ==================     |
|                                                                       |
|  "What to generate"     "Build the scene"      "Render frames"       |
|                                                                       |
|  - Scenario spec        - Asset placement      - Sensor rendering    |
|  - Distribution params  - Material assignment   - Physics sim        |
|  - Label requirements   - Lighting setup        - Multi-frame        |
|  - Camera config        - Weather config        - All sensor types   |
|                                                                       |
|  Stage 4: LABELING                                                    |
|  ==================                                                   |
|                                                                       |
|  "Extract annotations"                                                |
|                                                                       |
|  - 2D/3D bounding boxes (automatic from scene graph)                 |
|  - Segmentation masks   (automatic from object IDs)                  |
|  - Depth maps           (automatic from z-buffer)                    |
|  - Optical flow         (automatic from motion vectors)              |
|  - Lane markings        (automatic from map data)                    |
|  - Occlusion flags      (automatic from visibility)                  |
|                                                                       |
+----------------------------------------------------------------------+

5.3 Scene Generation Approaches

Applied Intuition supports multiple approaches to defining what synthetic scenes to generate:

Approach 1: Natural Language (via Copilot)

Input:  "Urban intersection at dusk with heavy rain, 3 vehicles
         waiting at a red light, 1 pedestrian crossing, and a
         cyclist approaching from the left."

Output: Fully specified scene with all parameters resolved:
        - Map: urban_intersection_4way_v3
        - Time: 18:45 (civil twilight)
        - Weather: rain_heavy (droplet density: 0.8, ground wetness: 0.9)
        - Vehicles: [sedan_blue @ lane_2_pos_5, suv_white @ lane_2_pos_15, ...]
        - Pedestrians: [adult_male_v2 @ crosswalk_north, speed: 1.2 m/s]
        - Cyclist: [road_cyclist_v1 @ approach_west, speed: 5.0 m/s]

Approach 2: Distribution-Based

# Specify statistical distributions to match real-world data
scene_distribution = {
    "location_type": Categorical({"urban": 0.4, "suburban": 0.35, "highway": 0.25}),
    "time_of_day": Uniform(6, 22),  # 6 AM to 10 PM
    "weather": Categorical({
        "clear": 0.60, "cloudy": 0.20, "rain": 0.12,
        "fog": 0.05, "snow": 0.03
    }),
    "num_vehicles": Poisson(lam=8),
    "num_pedestrians": Poisson(lam=3),
    "vehicle_types": Categorical({
        "sedan": 0.40, "suv": 0.30, "truck": 0.15,
        "van": 0.10, "motorcycle": 0.05
    }),
}

# Generate 100,000 scenes matching this distribution
dataset = generator.sample(scene_distribution, n=100000)

Approach 3: Log Extraction

Real Drive Log  -->  Scene Understanding  -->  Synthetic Reproduction
                     - Object detection
                     - Map matching            Modify:
                     - Weather estimation       - Swap actor types
                     - Light estimation         - Change weather
                                                - Add rare objects
                                                - Vary density

Approach 4: Procedural Generation

Rules-based scene construction:
  1. Select map tile
  2. Place ego vehicle on valid lane
  3. Populate traffic using rule-based placement
     - Respect lane geometry
     - Enforce minimum following distances
     - Place parked vehicles in valid spots
  4. Add environmental detail
     - Street furniture (signs, lights, hydrants)
     - Vegetation
     - Road markings
  5. Set environmental conditions
     - Sun position from time + GPS coordinates
     - Weather particle systems
     - Ground material wetness

5.4 Supported Labels

+----------------------------------------------------------------------+
|                    SYNTHETIC DATA LABEL TYPES                         |
+----------------------------------------------------------------------+
|                                                                       |
|  2D Labels:              3D Labels:             Dense Labels:         |
|  +------------------+   +------------------+   +------------------+   |
|  | Bounding boxes   |   | 3D bounding boxes|   | Depth maps       |   |
|  | (x, y, w, h)     |   | (x,y,z,l,w,h,   |   | (per-pixel)      |   |
|  |                   |   |  yaw, class)     |   |                   |   |
|  | Keypoints         |   | 3D keypoints     |   | Optical flow     |   |
|  | (skeleton joints) |   |                   |   | (u, v per pixel) |   |
|  +------------------+   +------------------+   |                   |   |
|                                                 | Surface normals  |   |
|  Segmentation:           Temporal:              | (nx, ny, nz)     |   |
|  +------------------+   +------------------+   +------------------+   |
|  | Semantic seg     |   | Object tracking  |                         |
|  | (per-pixel class)|   | (consistent IDs) |   Map Labels:           |
|  |                   |   |                   |   +------------------+  |
|  | Instance seg     |   | Activity labels  |   | Lane boundaries  |  |
|  | (per-pixel ID)   |   | (turning, parked)|   | Lane types       |  |
|  |                   |   +------------------+   | Road edges       |  |
|  | Panoptic seg     |                           | Crosswalks       |  |
|  | (semantic + inst)|                           | Stop lines       |  |
|  +------------------+                           +------------------+  |
|                                                                       |
+----------------------------------------------------------------------+

5.5 Domain Gap Mitigation

The biggest challenge with synthetic data is the domain gap -- the statistical difference between synthetic and real-world images that causes models trained on synthetic data to perform poorly on real data.

+----------------------------------------------------------------------+
|                    DOMAIN GAP SOURCES                                  |
+----------------------------------------------------------------------+
|                                                                       |
|  Appearance Gap:                  Geometry Gap:                       |
|  - Texture quality                - Asset shape accuracy              |
|  - Material realism               - Scene layout plausibility         |
|  - Lighting accuracy              - Object placement naturalism       |
|  - Shadow quality                 - Vegetation geometry               |
|                                                                       |
|  Content Gap:                     Sensor Gap:                         |
|  - Object variety                 - Noise characteristics             |
|  - Scene diversity                - Lens artifacts                    |
|  - Behavioral realism             - Compression artifacts             |
|  - Weather authenticity           - Sensor-specific quirks            |
|                                                                       |
+----------------------------------------------------------------------+

Mitigation Strategy 1: Fourier Domain Adaptation (FDA)

# FDA bridges the appearance gap by swapping low-frequency
# Fourier components between synthetic and real images

def fourier_domain_adaptation(synthetic_img, real_img, beta=0.01):
    """
    Swap the low-frequency spectrum of synthetic image
    with that of a real image. This transfers the "style"
    (color distribution, overall lighting) while preserving
    "content" (object shapes, layout).

    Args:
        synthetic_img: Generated synthetic image
        real_img: Real-world reference image
        beta: Fraction of spectrum to swap (0.01 = 1%)
    """
    # 1. FFT both images
    syn_fft = torch.fft.fft2(synthetic_img)
    real_fft = torch.fft.fft2(real_img)

    # 2. Create low-frequency mask
    h, w = synthetic_img.shape[-2:]
    mask = create_low_freq_mask(h, w, beta)

    # 3. Swap low-frequency components
    adapted_fft = syn_fft.clone()
    adapted_fft[mask] = real_fft[mask]

    # 4. Inverse FFT
    adapted_img = torch.fft.ifft2(adapted_fft).real

    return adapted_img

Mitigation Strategy 2: CyCADA (Cycle-Consistent Adversarial Domain Adaptation)

+----------------------------------------------------------------------+
|                    CyCADA ARCHITECTURE                                |
+----------------------------------------------------------------------+
|                                                                       |
|  Synthetic Image  -----> Generator_S2R -----> Adapted Image          |
|       |                                            |                  |
|       |            Cycle Consistency               |                  |
|       +<---------- Generator_R2S <-----------------+                  |
|                                                                       |
|  Real Image -----> Discriminator -----> Real or Adapted?             |
|                    (learns to tell                                     |
|                     real from adapted)                                 |
|                                                                       |
|  Key Loss Terms:                                                      |
|  1. Adversarial loss: adapted images should fool discriminator        |
|  2. Cycle consistency: S -> R -> S should reconstruct original        |
|  3. Semantic consistency: labels should be preserved after adaptation  |
|  4. Feature-level alignment: intermediate features should match       |
|                                                                       |
+----------------------------------------------------------------------+

Mitigation Strategy 3: Style Randomization

Instead of trying to match real-world appearance exactly, randomize the visual style so aggressively that the model learns to be invariant to appearance:

For each training image:
  - Randomly vary: brightness, contrast, saturation, hue
  - Random color jitter on materials
  - Random lighting direction and intensity
  - Random post-processing effects

Theory: If the model sees enough visual variation during training,
        it will learn features that are robust to any domain shift,
        including the sim-to-real gap.

5.6 Training Best Practices

Based on Applied Intuition's published guidance and industry best practices:

Strategy 1: Mixed Training + Fine-Tuning (Recommended)

Phase 1: Pre-train on large synthetic dataset (100K-1M images)
  - Cheap to generate
  - Perfect labels
  - High diversity

Phase 2: Fine-tune on smaller real dataset (1K-10K images)
  - Adapts to real-world distribution
  - Preserves features learned from synthetic data
  - Much less real data needed than training from scratch

Result: Up to 90% reduction in real-data annotation requirements

Strategy 2: Minority Class Upsampling

Real-World Problem:
  Cars:         100,000 examples  (easy to collect)
  Trucks:        20,000 examples
  Motorcycles:    5,000 examples
  Wheelchairs:      200 examples  (critically underrepresented)
  Construction:     150 examples  (rare but safety-critical)
  Animals:          100 examples

Solution with Synthetic Data:
  Generate 50,000 synthetic examples per minority class
  Mixed training: real + synthetic for all classes

  Result: Detection recall on minority classes improves 20-40%
          without degrading majority class performance

Strategy 3: Lower Bound Guarantee

Key Finding from Academic + Applied Intuition Research:
=======================================================

Adding synthetic data NEVER hurts performance (with proper training)

  Performance(real + synthetic) >= Performance(real only)

This "lower bound guarantee" holds when:
  1. Synthetic data is used for pre-training, not replacement
  2. Fine-tuning on real data is always the final step
  3. Synthetic distribution is at least as diverse as real
  4. Domain adaptation is applied to reduce gap

This means there is NO risk to adding synthetic data --
only upside. This makes it an easy sell to engineering teams.

5.7 Case Studies

Case Study 1: 90% Data Reduction

Task: Train 3D object detector for autonomous trucking

Baseline:
  - 100,000 manually annotated real frames
  - 6 weeks of annotation time
  - $500K annotation cost
  - mAP: 72.3%

With Synthetic Data:
  - 10,000 real frames (10% of original)
  - 500,000 synthetic frames (auto-labeled)
  - 2 days of generation time
  - $15K compute cost
  - mAP: 73.1% (BETTER than baseline)

  Savings: $485K and 5.5 weeks per annotation cycle

Case Study 2: Minority Class Upsampling

Task: Improve pedestrian detection in rain conditions

Problem:
  - Rain-condition pedestrian examples: 342 in entire dataset
  - Miss rate in rain: 23% (vs. 4% in clear weather)

Solution:
  - Generate 50,000 synthetic rain + pedestrian scenes
  - Fine-tune detector on mixed real + synthetic data

Result:
  - Rain pedestrian miss rate: 23% -> 8%
  - Clear weather performance: unchanged
  - No additional real-data collection needed

6. Sensor Simulation Engine

6.1 Custom Vulkan Rendering Engine

Applied Intuition built their sensor simulation engine from scratch using the Vulkan graphics API rather than licensing a game engine. This was a deliberate, multi-year investment that provides fundamental technical advantages:

+----------------------------------------------------------------------+
|              WHY BUILD A CUSTOM RENDERING ENGINE?                      |
+----------------------------------------------------------------------+
|                                                                       |
|  1. BIT-EXACT DETERMINISM                                             |
|     Game engines: floating point order varies between runs            |
|     Custom engine: identical output for identical input, every time   |
|     Why it matters: regression testing requires reproducibility       |
|                                                                       |
|  2. FULL PIPELINE CONTROL                                             |
|     Game engines: fixed rendering pipeline, can only hook in          |
|     Custom engine: every stage is customizable                        |
|     Why it matters: sensor physics don't follow game rendering rules  |
|                                                                       |
|  3. MULTI-SPECTRAL RENDERING                                          |
|     Game engines: visible light only (RGB)                            |
|     Custom engine: NIR, LWIR, multi-band in same pipeline            |
|     Why it matters: cameras increasingly use NIR (night vision),     |
|                     thermal cameras are used in defense               |
|                                                                       |
|  4. TRUE SENSOR MODELS                                                |
|     Game engines: "looks good to humans" rendering                    |
|     Custom engine: "matches sensor physics" rendering                 |
|     Why it matters: ML models exploit statistical artifacts that      |
|                     humans don't notice                               |
|                                                                       |
+----------------------------------------------------------------------+

6.2 Ray Tracing Benefits

The Vulkan engine uses hardware-accelerated ray tracing (via VK_KHR_ray_tracing) for physically accurate light transport simulation:

Ray Tracing Enables:
====================

1. ACCURATE REFLECTIONS
   +-----------+       +----------+
   | Wet Road  | <---- | Headlight|  Ray traced reflection on wet pavement
   | (mirror-  |       | (light   |  reveals vehicles hidden by road geometry
   |  like)    |       |  source) |
   +-----------+       +----------+

   Why it matters: perception must handle reflections correctly
   or it will detect "phantom" vehicles in the road surface

2. GHOST TARGETS (Radar Multi-path)

   Vehicle A          Wall           Vehicle B (hidden)
   +--------+     +---------+       +--------+
   | Radar  |====>|         |======>|        |
   | Sensor |     |  Bounce |       | Ghost  |
   +--------+     +---------+       | Target |
                                    +--------+

   Ray tracing models multi-path radar reflections that create
   ghost targets -- critical for radar perception testing

3. ROLLING SHUTTER SIMULATION

   Real cameras read pixels row-by-row, not all at once:

   Row 0:    t = 0.000 ms  -----> captured
   Row 100:  t = 0.833 ms  -----> captured
   Row 200:  t = 1.667 ms  -----> captured
   Row 1200: t = 10.00 ms  -----> captured

   Fast-moving objects appear skewed due to row timing.
   Ray tracing enables per-row temporal offset simulation.

4. ACCURATE SHADOWS

   Soft shadows from area lights (sun, sky)
   Hard shadows from point lights (streetlights)
   Shadow cascading for large outdoor scenes

   Shadows affect perception significantly:
   - Shadows on road can be mistaken for obstacles
   - Objects in shadow are harder to detect
   - Shadow patterns vary with time and weather

5. GLOBAL ILLUMINATION

   Light bouncing between surfaces (color bleeding):
   - Red car parked next to white wall -> wall has pink tint
   - Under-bridge lighting from ground bounce
   - Interior of tunnel from scattered daylight

6.3 Physics-Based Sensor Models

Each sensor type has a dedicated physics-based simulation model:

Camera Model:

+----------------------------------------------------------------------+
|                    CAMERA SENSOR MODEL                                 |
+----------------------------------------------------------------------+
|                                                                       |
|  Scene Radiance --> Lens Model --> Sensor Model --> ISP --> Output    |
|                                                                       |
|  Lens Model:                   Sensor Model:                          |
|  - Focal length                - Bayer pattern                        |
|  - Aperture (f-stop)           - Quantum efficiency                   |
|  - Distortion (Brown model)    - Read noise                           |
|  - Chromatic aberration        - Dark current                         |
|  - Vignetting                  - ADC quantization                     |
|  - Flare/ghosting              - Pixel crosstalk                      |
|                                - Rolling shutter                      |
|  ISP (Image Signal Processor):                                        |
|  - Demosaicing                 - HDR tone mapping                     |
|  - White balance               - Gamma correction                     |
|  - Denoising                   - JPEG compression                     |
|  - Sharpening                  - Auto exposure                        |
|                                                                       |
+----------------------------------------------------------------------+

LiDAR Model:

+----------------------------------------------------------------------+
|                    LIDAR SENSOR MODEL                                  |
+----------------------------------------------------------------------+
|                                                                       |
|  Beam Pattern:                  Return Model:                         |
|  - Firing sequence              - Surface reflectance (Lambertian)    |
|  - Angular resolution           - Beam divergence                     |
|  - Vertical channels            - Range-dependent SNR                 |
|  - Rotation rate                - Multi-return processing             |
|  - Jitter model                 - Near-field cutoff                   |
|                                                                       |
|  Environmental Effects:         Noise Model:                          |
|  - Rain attenuation             - Range noise (Gaussian + outliers)   |
|  - Fog scattering               - Intensity noise                     |
|  - Snow returns                 - Beam dropout                        |
|  - Sun saturation (blooming)    - Crosstalk between beams             |
|                                                                       |
|  Supported LiDAR Models:                                              |
|  Velodyne (VLP-16/32/128), Ouster (OS1/2), Hesai (Pandar/AT128),    |
|  Luminar (Iris), Innoviz (InnovizTwo), Robosense, Livox, custom      |
|                                                                       |
+----------------------------------------------------------------------+

Radar Model:

+----------------------------------------------------------------------+
|                    RADAR SENSOR MODEL                                  |
+----------------------------------------------------------------------+
|                                                                       |
|  Electromagnetic Simulation:    Signal Processing:                    |
|  - Transmit pattern (FMCW)     - Range-Doppler FFT                   |
|  - Antenna array model          - CFAR detection                      |
|  - RCS computation              - Angle estimation (DOA)              |
|  - Multi-path propagation       - Clustering                          |
|  - Ground bounce                - Tracking                            |
|                                                                       |
|  Material Properties:           Environmental:                        |
|  - Metal: high RCS              - Rain clutter                        |
|  - Plastic: low RCS             - Road spray                          |
|  - Glass: angle-dependent       - Guardrail multi-path               |
|  - Rubber: absorptive           - Tunnel effects                      |
|  - Asphalt: rough surface       - Bridge overpass reflections         |
|                                                                       |
|  Output Formats:                                                      |
|  - Detection list (range, azimuth, elevation, velocity, RCS)         |
|  - Range-Doppler map (2D FFT output)                                 |
|  - Point cloud (after clustering)                                    |
|  - Raw ADC samples (for signal processing R&D)                       |
|                                                                       |
+----------------------------------------------------------------------+

Ultrasonic Model:

+----------------------------------------------------------------------+
|                  ULTRASONIC SENSOR MODEL                               |
+----------------------------------------------------------------------+
|                                                                       |
|  Near-field simulation (0.2m - 5m range):                            |
|  - Acoustic propagation modeling                                      |
|  - Temperature-dependent speed of sound                               |
|  - Surface reflection coefficients                                    |
|  - Beam pattern simulation                                            |
|  - Cross-talk between sensors                                         |
|                                                                       |
|  Use cases: parking assist, low-speed maneuvering, object proximity  |
|                                                                       |
+----------------------------------------------------------------------+

6.4 Three Perception Simulation Strategies

Applied Intuition supports three distinct approaches to testing perception systems, each with different fidelity/flexibility tradeoffs:

+----------------------------------------------------------------------+
|        THREE STRATEGIES FOR PERCEPTION SIMULATION                     |
+----------------------------------------------------------------------+

Strategy 1: LOG REPLAY (Highest Fidelity, Lowest Flexibility)
=============================================================

  Real Sensor Data --> Perception Stack --> Compare to Ground Truth
  (recorded logs)     (under test)         (human annotations)

  Pro: Perfect sensor fidelity (it IS real data)
  Pro: No domain gap whatsoever
  Con: Cannot modify scenarios (fixed to what was recorded)
  Con: Requires expensive human annotations for ground truth
  Con: Limited to ego's original viewpoint

  Best for: Perception regression testing on known scenarios


Strategy 2: ACTOR PATCHING (Medium Fidelity, Medium Flexibility)
================================================================

  Real Background  +  Synthetic Actors  -->  Perception Stack
  (from log)          (rendered into scene)   (under test)

  Pro: Real background maintains fidelity for static scene
  Pro: Can add/remove/modify actors (vehicles, pedestrians)
  Pro: Automatic ground truth for synthetic actors
  Con: Lighting mismatch between real BG and synthetic actors
  Con: Limited to original ego viewpoint for background

  Best for: Testing specific interaction scenarios


Strategy 3: FULLY SYNTHETIC (Lowest Fidelity, Highest Flexibility)
==================================================================

  Fully Rendered Scene  -->  Perception Stack  -->  Evaluation
  (Sensor Sim engine)        (under test)

  Pro: Complete control over every aspect of the scene
  Pro: Any scenario, any viewpoint, any conditions
  Pro: Perfect ground truth for everything
  Pro: Unlimited diversity
  Con: Domain gap (synthetic != real)
  Con: Requires high-quality 3D assets
  Con: Computationally expensive at scale

  Best for: Edge case testing, rare scenario coverage,
            safety validation, dataset generation

7. Defense & Multi-Domain Expansion

Applied Intuition has aggressively expanded into the defense and national security market, recognizing that autonomous systems for military applications face similar simulation and validation challenges as commercial AV.

7.1 Applied Intuition Defense

The defense division operates through a separate entity (Applied Intuition Defense) to handle classified programs and ITAR compliance.

7.2 Axion: Mission-Critical Toolchain

+----------------------------------------------------------------------+
|                         AXION                                         |
+----------------------------------------------------------------------+
|                                                                       |
|  "The simulation and testing platform for defense autonomous systems" |
|                                                                       |
|  Capabilities:                                                        |
|  - Multi-domain simulation (ground, air, maritime, space)            |
|  - Contested environment modeling (EW, GPS-denied, comms-degraded)   |
|  - Red team / blue team scenario generation                          |
|  - Swarm behavior testing                                            |
|  - Mission planning and rehearsal                                    |
|  - Integration with DoD accreditation frameworks                     |
|  - Classified environment deployment (IL4/IL5/IL6)                   |
|                                                                       |
|  Based on same core engine as commercial products, with:             |
|  - Military-specific sensor models (EO/IR, SAR, SIGINT)             |
|  - Threat models (adversarial actors, countermeasures)               |
|  - Terrain models (DTED, CDB)                                       |
|  - C2 integration (command and control)                              |
|                                                                       |
+----------------------------------------------------------------------+

7.3 Acuity: Onboard Autonomy Software

Acquired through the EpiSci acquisition, Acuity is Applied Intuition's onboard autonomy software for defense platforms:

+----------------------------------------------------------------------+
|                         ACUITY                                        |
+----------------------------------------------------------------------+
|                                                                       |
|  "Tactical AI for autonomous decision-making"                         |
|                                                                       |
|  Capabilities:                                                        |
|  - Real-time autonomous decision-making                               |
|  - Adversarial reasoning (predict enemy actions)                      |
|  - Multi-agent coordination (swarm tactics)                           |
|  - Human-on-the-loop autonomy levels                                  |
|  - Sensor fusion for military sensors                                 |
|  - Degraded communications handling                                   |
|                                                                       |
|  Heritage:                                                            |
|  - DARPA ACE (Air Combat Evolution) program                          |
|  - Demonstrated autonomous air combat maneuvers                       |
|  - Live flight testing on military aircraft                           |
|                                                                       |
+----------------------------------------------------------------------+

7.4 Defense Programs

Program	Customer	Description
Army RCV	US Army	Robotic Combat Vehicle simulation and testing
AiTR	US Army	Automatic Target Recognition algorithms
Beacon	Northrop Grumman	Autonomous systems integration and testing
DARPA ACE	DARPA (via EpiSci)	Autonomous air combat AI
Various Classified	Multiple	Cannot be publicly discussed

7.5 UK Expansion

Applied Intuition announced UK operations to serve NATO allies and the UK Ministry of Defence, reflecting the international demand for autonomous systems validation in defense contexts. The UK expansion (along with the broader European SDS launch announced in January 2026) positions the company for significant international growth.

7.6 Congressional Testimony

In December 2025, Applied Intuition leadership provided congressional testimony on "AI, Autonomy, and America's Strategic Infrastructure," highlighting the dual-use nature of autonomous vehicle technology and its implications for national security. This reflects the company's strategic positioning as a critical infrastructure provider for both commercial and defense autonomous systems.

8. Competitive Landscape

8.1 Detailed Competitor Comparison

+======================================================================+
|               COMPETITIVE LANDSCAPE MATRIX                            |
+======================================================================+

                     Applied    Waymo     NVIDIA    CARLA    Waabi   Foretellix
                     Intuition  (Waymo    DRIVE    (open-           (scenario
                     (platform) SimAgent)  Sim     source)          testing)
----------------------------------------------------------------------
Product Type         Platform   Internal  Platform  Open-   Stack+  Testing
                     (B2B)      tool      (B2B)     source  sim     platform
----------------------------------------------------------------------
Sensor Sim           Custom     Internal  Omniverse CARLA   Custom  Partners
Engine               Vulkan     engine    (RTX)     UE4     engine
----------------------------------------------------------------------
Neural Rendering     Gaussian   NeRF/     Limited   No      NeRF    No
                     Splatting  3DGS
----------------------------------------------------------------------
Object Sim           Yes        Yes       Yes       Yes     Yes     Yes
(scenario-based)     (ISO cert) (internal)(basic)   (basic) (basic) (deep)
----------------------------------------------------------------------
Synthetic Data       Full       Internal  Full      Partial No      No
Pipeline             pipeline             pipeline
----------------------------------------------------------------------
LiDAR Sim           Ray-traced  Internal  Ray-     UE4     Ray-    Partners
                     (physics)            traced   depth   traced
----------------------------------------------------------------------
Radar Sim           Physics-    Internal  Basic     No     Basic    No
                     based EM
----------------------------------------------------------------------
Cloud Orchestration  Cloud      Internal  DGX      Docker  Cloud   Cloud
                     Engine     infra     Cloud
----------------------------------------------------------------------
V&V / Compliance     Yes (ISO)  Internal  Limited  No      No      Yes
----------------------------------------------------------------------
Map Tools            Yes (AI)   Internal  Limited  OSM     No      Limited
----------------------------------------------------------------------
Defense              Yes (Axion Internal  NVIDIA   No      No      No
                     + Acuity)            Military
----------------------------------------------------------------------
Autonomy Stack       Yes (SDS)  Waymo     NVIDIA   No      Waabi   No
                                Driver    DRIVE            Driver
----------------------------------------------------------------------
Customer Base        18/20 OEMs 0         Growing  Open    <5      ~10 OEMs
                                (internal)                  OEMs
----------------------------------------------------------------------
Pricing              Enterprise N/A       Enter-   Free    Enter-  Enterprise
                     SaaS                  prise           prise   SaaS
----------------------------------------------------------------------

8.2 Competitor Deep Dives

Waymo (Alphabet)

Strengths:
  - Most real-world autonomous driving experience (20M+ rider-only miles)
  - Deep internal simulation (billions of simulated miles)
  - Published research: SimAgent, WayMax (open-source), WOSAC challenge
  - Best real-world data flywheel

Weaknesses (from Applied Intuition's perspective):
  - Internal tools -- not available to OEMs
  - Not a platform play (Waymo builds its own car, not tools for others)
  - Focused on robotaxi, not ADAS/L2++

Relationship: Not a direct competitor (different business models)
              But sets the technical bar for simulation quality

NVIDIA DRIVE Sim (Omniverse)

Strengths:
  - RTX GPU ray tracing hardware ecosystem
  - Omniverse platform for 3D collaboration
  - NVIDIA DRIVE ecosystem (compute + software)
  - DGX Cloud for simulation compute
  - Strong brand with OEMs for compute hardware

Weaknesses:
  - Sensor physics fidelity lags Applied Intuition
  - No neural rendering (Gaussian Splatting) in production
  - Weaker V&V and compliance tooling
  - Platform maturity: newer entrant to AV simulation specifically
  - Conflict of interest: sells compute AND simulation (OEMs wary of lock-in)

Relationship: Primary competitive threat
              Often competes head-to-head for OEM deals
              NVIDIA has the GPU ecosystem advantage
              Applied Intuition has the simulation depth advantage

CARLA (Open-Source)

Strengths:
  - Free and open-source
  - Large research community
  - Good for academic prototyping
  - Unreal Engine 4 graphics
  - Extensive documentation

Weaknesses:
  - Not production-grade (no ISO certification)
  - Non-deterministic (UE4 rendering)
  - No real sensor physics (approximated)
  - No cloud orchestration at scale
  - No professional support or V&V tooling
  - Scaling beyond ~100 concurrent sims is painful

Relationship: Not a direct competitor for enterprise deals
              But influences researcher expectations
              Some Applied Intuition customers "graduate" from CARLA

Waabi (AI-first autonomous driving)

Strengths:
  - Founded by Raquel Urtasun (former Uber ATG chief scientist)
  - AI-first simulation approach
  - Strong academic pedigree
  - Focused on trucking (large market)
  - Raised ~$200M+

Weaknesses:
  - Much smaller scale than Applied Intuition
  - Building both stack + sim (diluted focus)
  - Limited OEM customer base
  - Less mature platform

Relationship: Competitor in trucking autonomous driving
              Different approach (integrated stack vs. platform)

Foretellix

Strengths:
  - Deep scenario-based testing expertise
  - V&V focused (coverage-driven verification)
  - BSI certification for safety assurance
  - Strong in European OEM market

Weaknesses:
  - Narrow product scope (testing/V&V only, not full simulation)
  - No sensor simulation engine
  - No neural rendering
  - No synthetic data
  - Acquired by NVIDIA (2024) -- integration uncertainty

Relationship: Was a direct competitor for testing/V&V
              Now part of NVIDIA -- strengthens NVIDIA's offering
              Applied Intuition must match V&V depth

8.3 Applied Intuition's Competitive Advantages

+----------------------------------------------------------------------+
|           APPLIED INTUITION'S COMPETITIVE MOATS                       |
+----------------------------------------------------------------------+
|                                                                       |
|  1. CUSTOMER LOCK-IN (18/20 top OEMs)                                |
|     - Years of workflow integration                                   |
|     - Switching cost: millions of dollars + years of migration        |
|     - Engineers trained on Applied Intuition tools                    |
|                                                                       |
|  2. FULL-STACK PLATFORM                                               |
|     - No competitor matches breadth (sim + data + V&V + maps + stack)|
|     - Single vendor simplifies procurement for OEMs                  |
|     - Cross-product data flywheel                                    |
|                                                                       |
|  3. CUSTOM RENDERING ENGINE                                           |
|     - 5+ years of development investment                             |
|     - Sensor physics accuracy unmatched by game-engine approaches    |
|     - Deterministic execution for CI/CD                              |
|                                                                       |
|  4. NEURAL SIM + SYNTHETIC DATA PIPELINE                              |
|     - Gaussian Splatting + PBR hybrid approach                       |
|     - Automated reconstruction pipeline                              |
|     - No competitor has this at production scale                     |
|                                                                       |
|  5. DEFENSE EXPANSION                                                 |
|     - EpiSci acquisition brings classified program access            |
|     - Dual-use technology creates R&D funding flywheel               |
|     - Defense margins typically higher than commercial               |
|                                                                       |
|  6. SDS PROGRAM (INTERNAL CUSTOMER)                                   |
|     - Dogfooding their own tools                                     |
|     - Identifies gaps before customers do                            |
|     - Proves platform can support full AV development                |
|                                                                       |
+----------------------------------------------------------------------+

8.4 Technology Bets and Risks

Bet	Upside	Risk
Gaussian Splatting	Real-time neural rendering, editability	Technology is young; quality may plateau
Custom Vulkan Engine	Full control, sensor accuracy	Massive maintenance burden; must keep pace with GPU arch changes
SDS (Autonomy Stack)	Huge TAM; validates own tools	May alienate OEM customers who view it as competing
Defense Expansion	High margins; government contracts	Classified work is slow; different sales cycles
B2B SaaS for All OEMs	Massive market coverage	Some OEMs may eventually build in-house
Generative AI (Copilot, Maps)	Productivity multiplier	Relies on LLM quality; liability for generated scenarios

9. Team Structure & What to Expect

9.1 Key Engineering Teams

+======================================================================+
|                    ENGINEERING TEAM MAP                                |
+======================================================================+
|                                                                       |
|  SIMULATION                    DATA & ML                              |
|  ==========                    =========                              |
|  Neural Sim Team               Synthetic Data Team                    |
|  - 3D reconstruction           - Data generation pipelines            |
|  - Gaussian Splatting          - Domain adaptation                    |
|  - Neural rendering            - Auto-labeling                        |
|  - Scene manipulation          - Training integration                 |
|                                                                       |
|  Sensor Sim Team               Data Explorer Team                     |
|  - Vulkan rendering engine     - Fleet data management                |
|  - Physics-based models        - Search and retrieval                 |
|  - Ray tracing                 - Anomaly detection                    |
|  - Multi-spectral              - Scalable infrastructure              |
|                                                                       |
|  Object Sim Team               ML Platform Team                       |
|  - Scenario execution          - Training infrastructure              |
|  - Traffic models              - Model deployment                     |
|  - Physics engine              - Experiment tracking                  |
|  - Deterministic sim           - GPU cluster management               |
|                                                                       |
|  INFRASTRUCTURE                PRODUCT                                |
|  ==============                =======                                |
|  Cloud Engine Team             SDS Team                               |
|  - Parallel orchestration      - Perception stack                     |
|  - Auto-scaling                - Planning/control                     |
|  - Job scheduling              - Integration                          |
|  - Multi-cloud                 - Vehicle deployment                   |
|                                                                       |
|  Platform Team                 Map Toolset Team                       |
|  - Core APIs                   - HD map creation                      |
|  - Developer experience        - Generative AI for maps              |
|  - CI/CD integration           - Map formats                          |
|  - Auth/security               - Real-time updates                    |
|                                                                       |
+======================================================================+

9.2 Skills Needed Per Team

Based on job postings and technical requirements:

Neural Sim Team:

Skill	Priority	Details
3D Computer Vision	Critical	Multi-view geometry, camera models, Structure from Motion
Neural Rendering	Critical	Gaussian Splatting, NeRF, differentiable rendering
C++	Critical	High-performance rendering code, GPU programming
Python	High	ML training pipelines, data processing
PyTorch/JAX	High	Model training and inference
CUDA	High	Custom GPU kernels for rendering
Linear Algebra	High	Transformations, projections, optimization
Graphics	Medium	Rasterization, ray tracing fundamentals

Synthetic Data Team:

Skill	Priority	Details
Computer Vision	Critical	Object detection, segmentation, depth estimation
ML/DL	Critical	Training pipelines, model evaluation, domain adaptation
Python	Critical	Data pipelines, training scripts, evaluation
C++	High	Rendering pipeline integration
Domain Adaptation	High	FDA, CyCADA, style transfer, sim-to-real
Data Engineering	High	Large-scale data processing, formats, storage
Vulkan/OpenGL	Medium	Understanding of rendering pipeline
Statistics	Medium	Distribution matching, experiment design

Sensor Sim Team:

Skill	Priority	Details
C++	Critical	Core rendering engine is C++
Vulkan	Critical	Graphics API used for rendering engine
Ray Tracing	Critical	Hardware RT, BVH, intersection
Sensor Physics	High	Optics, EM propagation, acoustics
GPU Programming	High	CUDA, compute shaders
Computer Graphics	High	PBR, materials, lighting, cameras
Signal Processing	Medium	For radar/LiDAR models
Python	Medium	Testing, validation, tooling

Cloud Engine Team:

Skill	Priority	Details
Distributed Systems	Critical	Orchestration, scheduling, fault tolerance
Cloud Infrastructure	Critical	AWS/Azure/GCP, Kubernetes, Terraform
Go/Rust/C++	High	Backend services
Python	High	Scripting, tooling, APIs
Database Systems	High	Metrics storage, time-series data
CI/CD	High	Integration pipelines, automation
Networking	Medium	High-bandwidth data transfer

9.3 Work Culture

+----------------------------------------------------------------------+
|                    WORK CULTURE OVERVIEW                               |
+----------------------------------------------------------------------+
|                                                                       |
|  Location:       Mountain View, CA (primary campus)                   |
|                  Additional offices globally                          |
|  In-Office:      5 days per week (full in-office policy)             |
|  Team Size:      ~1,800+ employees (as of 2025)                      |
|  Growth:         Rapid hiring across all teams                        |
|                                                                       |
|  Culture Traits:                                                      |
|  - Engineering-first (technical founders, technical culture)          |
|  - High intensity (startup pace at unicorn scale)                    |
|  - Cross-functional collaboration (sim + ML + infra teams interact)  |
|  - Customer-driven (OEM needs directly influence roadmap)            |
|  - White-box philosophy (transparency in how tools work)             |
|  - Research-to-production pipeline (ICLR papers -> product features) |
|                                                                       |
|  What to Expect:                                                      |
|  - Fast-paced iteration cycles                                       |
|  - Direct interaction with OEM engineering teams                     |
|  - Exposure to cutting-edge research (NeRF, 3DGS, RL, GenAI)       |
|  - Large-scale systems challenges (petabytes, thousands of GPUs)     |
|  - High ownership and autonomy within your domain                    |
|  - Emphasis on measurable impact (metrics-driven culture)            |
|                                                                       |
+----------------------------------------------------------------------+

9.4 Compensation Ranges

Based on publicly available job posting data (as of 2025-2026):

Level	Base Salary Range	Total Comp (est. with equity)
New Grad / Entry	$125K - $160K	$180K - $250K
Mid-Level (3-5 yr)	$160K - $195K	$250K - $380K
Senior (5-8 yr)	$185K - $222K	$350K - $500K+
Staff / Principal	$210K - $260K+	$450K - $700K+

Notes:

Bay Area cost of living adjustment applies
Equity component is significant (pre-IPO company at $15B valuation)
Equity is typically RSUs with 4-year vesting
Annual refresher grants are common
Benefits include standard tech company package (health, 401k, meals)

10. Onboarding Preparation

10.1 Technical Skills to Brush Up On

Priority 1: Must Know Well

+----------------------------------------------------------------------+
|  SKILL                  | FOCUS AREAS                                 |
+----------------------------------------------------------------------+
|  3D Gaussian Splatting  | Original paper (Kerbl et al., 2023),        |
|                         | differentiable rendering, splatting vs.     |
|                         | rasterization, densification strategies      |
+----------------------------------------------------------------------+
|  3D Reconstruction      | Structure from Motion (SfM), multi-view     |
|                         | stereo, bundle adjustment, COLMAP           |
+----------------------------------------------------------------------+
|  C++ (Modern)           | C++17/20, RAII, move semantics, templates,  |
|                         | concurrency, performance optimization       |
+----------------------------------------------------------------------+
|  Python (Scientific)    | NumPy, PyTorch, data pipelines, profiling,  |
|                         | multiprocessing                             |
+----------------------------------------------------------------------+
|  Linear Algebra         | Transformations (SE3), projections,          |
|                         | SVD, eigendecomposition, least squares      |
+----------------------------------------------------------------------+

Priority 2: Should Be Familiar With

+----------------------------------------------------------------------+
|  SKILL                  | FOCUS AREAS                                 |
+----------------------------------------------------------------------+
|  Vulkan API             | Graphics pipeline, compute shaders,          |
|                         | memory management, synchronization           |
+----------------------------------------------------------------------+
|  Ray Tracing            | BVH construction, intersection algorithms,   |
|                         | RTX hardware, denoising                     |
+----------------------------------------------------------------------+
|  NeRF                   | Original paper, volume rendering equation,   |
|                         | positional encoding, hierarchical sampling   |
+----------------------------------------------------------------------+
|  Domain Adaptation      | FDA, CyCADA, adversarial training,           |
|                         | distribution shift                           |
+----------------------------------------------------------------------+
|  PBR Rendering          | Cook-Torrance BRDF, material models,         |
|                         | environment mapping, IBL                     |
+----------------------------------------------------------------------+
|  Sensor Physics         | Camera optics, LiDAR time-of-flight,         |
|                         | radar FMCW, signal processing               |
+----------------------------------------------------------------------+

Priority 3: Nice to Know

+----------------------------------------------------------------------+
|  SKILL                  | FOCUS AREAS                                 |
+----------------------------------------------------------------------+
|  Autonomous Driving     | Perception, prediction, planning, control    |
|  Pipeline               | End-to-end vs. modular architectures        |
+----------------------------------------------------------------------+
|  Distributed Systems    | Cloud orchestration, job scheduling,         |
|                         | fault tolerance, data pipelines             |
+----------------------------------------------------------------------+
|  ISO 26262 / SOTIF      | Functional safety concepts, V&V             |
|                         | methodologies, ASIL levels                  |
+----------------------------------------------------------------------+
|  Reinforcement Learning | For traffic agent behavior models            |
|                         | (SPACeR, self-play methods)                 |
+----------------------------------------------------------------------+

10.2 Papers to Read Before Joining

Neural Rendering & 3D Reconstruction:

Paper	Why It Matters
"3D Gaussian Splatting for Real-Time Radiance Field Rendering" (Kerbl et al., 2023)	Foundation of Neural Sim's static scene reconstruction
"NeRF: Representing Scenes as Neural Radiance Fields" (Mildenhall et al., 2020)	Predecessor technology; understand the evolution
"Mip-NeRF 360" (Barron et al., 2022)	Unbounded scene reconstruction (relevant for driving scenes)
"Dynamic 3D Gaussians" (Luiten et al., 2023)	Extending Gaussians to dynamic scenes
"Street Gaussians" (Yan et al., 2024)	Gaussian Splatting specifically for driving scenes
"S2GO: Streaming Sparse Gaussian Occupancy" (Applied Intuition, ICLR 2026)	Company's own research; occupancy from Gaussians

Simulation & Synthetic Data:

Paper	Why It Matters
"SHIFT: A Synthetic Driving Dataset" (Sun et al., 2022)	Benchmark for synthetic data in driving
"Domain Randomization for Sim-to-Real Transfer" (Tobin et al., 2017)	Foundational approach to domain gap
"CyCADA" (Hoffman et al., 2018)	Cycle-consistent domain adaptation for synthetic data
"FDA: Fourier Domain Adaptation" (Yang & Soatto, 2020)	Simple but effective domain adaptation
"SPACeR: Self-Play Anchoring with Centralized Reference" (Applied Intuition, 2026)	Company's traffic simulation research

Autonomous Driving:

Paper	Why It Matters
"UniAD: Planning-Oriented Autonomous Driving" (Hu et al., 2023)	State-of-art end-to-end AV architecture
"nuScenes: A Multimodal Dataset" (Caesar et al., 2020)	Standard AV dataset format and benchmarks
"KITTI Vision Benchmark Suite" (Geiger et al., 2012)	Classic AV benchmark; synthetic data often uses KITTI format
"WayMax: An Accelerated Data-Driven Simulator" (Gulino et al., 2024)	Waymo's simulator; understand competitive approaches

10.3 Products to Understand

Before joining, make sure you can articulate the purpose and basic architecture of:

Must Understand Deeply:
  [x] Neural Sim -- what it does, how it works, why it matters
  [x] Synthetic Datasets -- generation pipeline, label types, domain gap
  [x] Sensor Sim -- why custom Vulkan, physics-based models

Should Understand Well:
  [x] Object Sim -- scenario-based testing, ISO certification
  [x] Cloud Engine -- parallel execution, CI/CD integration
  [x] Log Sim -- re-simulation, scenario extraction
  [x] SDS -- what it is, why Applied Intuition builds its own stack

Should Be Aware Of:
  [x] Data Explorer -- fleet data management
  [x] Validation Toolset -- V&V, compliance
  [x] Map Toolset -- generative AI for maps
  [x] Copilot -- natural language scenario generation
  [x] Axion / Acuity -- defense products

10.4 Questions to Ask During Interviews

Technical Questions:

"How do you handle the transition between Gaussian Splatting for backgrounds and PBR for dynamic actors? Is there a blending step, and how do you ensure lighting consistency?"
"What's the biggest remaining source of domain gap in your synthetic data pipeline, and what's the current approach to closing it?"
"How do you validate that Neural Sim reconstructions are 'good enough' for downstream AV stack testing? What metrics do you use?"
"What's the current bottleneck in the automated reconstruction pipeline -- is it compute, quality, or something else?"

Product & Strategy Questions: 5. "How does the SDS program interact with the simulation tools team? Does SDS drive the simulation roadmap, or vice versa?"

"Which sensor modality is hardest to simulate accurately -- camera, LiDAR, or radar? And where is the most active research happening?"
"How do you think about the competitive threat from NVIDIA DRIVE Sim, especially given their hardware ecosystem advantage?"

Team & Growth Questions: 8. "What does a typical project lifecycle look like on the Neural Sim team -- from research idea to production feature?"

"How much interaction is there between the Neural Sim and Synthetic Data teams? Are there shared components or pipelines?"
"What's the biggest technical challenge the team is working on right now that you're most excited about?"

11. Interview Questions

Ten technical interview questions with answer hints, focused on Applied Intuition context:

Q1: Gaussian Splatting vs. NeRF for Driving Scene Reconstruction

Question: "Why would you choose 3D Gaussian Splatting over NeRF for reconstructing driving scenes in a simulation pipeline? What are the tradeoffs?"

Answer Hints:

Key Points:
  1. RENDERING SPEED: 3DGS renders at 100+ FPS via rasterization;
     NeRF requires expensive ray marching (1-5 FPS).
     For simulation: need real-time for closed-loop testing.

  2. EXPLICIT GEOMETRY: 3DGS stores explicit 3D points (Gaussians);
     NeRF is implicit (MLP weights).
     For simulation: explicit geometry enables LiDAR ray casting,
     actor insertion/removal, and scene editing.

  3. TRAINING SPEED: 3DGS trains in minutes-hours;
     NeRF trains in hours-days.
     For simulation: automated pipeline needs fast turnaround.

  4. QUALITY: Both achieve high quality, but 3DGS can struggle with
     thin structures and reflective surfaces.

  5. MEMORY: 3DGS uses more memory (millions of Gaussians);
     NeRF is compact (MLP weights only).
     For simulation: memory is less critical than speed.

  Conclusion: For production simulation pipeline, 3DGS wins on
  speed, editability, and LiDAR compatibility. NeRF may still be
  useful for specific research applications.

Q2: Handling Dynamic Objects in Scene Reconstruction

Question: "How would you approach reconstructing a driving scene that contains moving vehicles and pedestrians?"

Answer Hints:

Approach: Separate static and dynamic reconstruction.

  1. DETECT dynamic objects (2D/3D detection + tracking)
  2. MASK dynamic objects from training data for static reconstruction
  3. INPAINT masked regions in camera images
  4. FILTER dynamic LiDAR points
  5. RECONSTRUCT static scene using masked data (Gaussian Splatting)
  6. RECONSTRUCT each dynamic actor separately:
     - Multi-view reconstruction across tracked frames
     - Fit parametric model (vehicle mesh) or reconstruct per-actor
     - Estimate material properties (PBR)
  7. COMPOSITE: Place reconstructed actors into static scene

  Challenges:
  - Occlusion: dynamic objects occlude each other and the background
  - Limited views: only see each actor from ego's viewpoint
  - Lighting consistency: actors must match scene illumination
  - Temporal coherence: no flickering between frames

Q3: Domain Gap in Synthetic Data

Question: "You've generated 1 million synthetic training images. A detector trained on this data achieves 80% mAP on synthetic test data but only 55% mAP on real data. How would you diagnose and fix this?"

Answer Hints:

Diagnosis:
  1. VISUALIZE: t-SNE/UMAP of features from synthetic vs. real images
     Look for cluster separation (indicates domain gap)

  2. ERROR ANALYSIS: What types of errors dominate on real data?
     - False negatives? (missing detections -> appearance gap)
     - False positives? (hallucinated objects -> content gap)
     - Misclassifications? (confusion between classes -> texture gap)

  3. ABLATION: Which factors cause the gap?
     - Test on real images with synthetic-like post-processing
     - Test synthetic images with real-world noise added

Fixes (in order of effort):
  1. MIXED TRAINING: Add even small amount of real data (5-10%)
  2. FDA: Fourier Domain Adaptation (swap low-freq spectrum)
  3. STYLE RANDOMIZATION: Aggressive visual augmentation during training
  4. CyCADA: Adversarial domain adaptation (more complex, more effective)
  5. FINE-TUNING: Pre-train on synthetic, fine-tune on real
  6. IMPROVE RENDERING: Better materials, lighting, sensor models

  Expected result: mixed training + FDA alone typically closes
  50-70% of the gap. Fine-tuning closes most of the remainder.

Q4: Sensor Simulation Fidelity

Question: "How would you validate that a simulated LiDAR sensor produces outputs that are statistically equivalent to the real sensor?"

Answer Hints:

Validation Framework:

  Level 1: POINT-LEVEL METRICS
  - Range accuracy (mean/std of range error vs. ground truth)
  - Intensity distribution (histogram comparison)
  - Angular accuracy (azimuth/elevation error)
  - Noise characteristics (noise floor, range-dependent noise)

  Level 2: OBJECT-LEVEL METRICS
  - Point density on objects (does a car get the right # of points?)
  - Occlusion accuracy (correct point dropouts behind objects)
  - Multi-return behavior (ground + vegetation canopy)
  - Near-field cutoff accuracy

  Level 3: SCENE-LEVEL METRICS
  - Overall point density distribution
  - Ground plane fit accuracy
  - Dynamic range (min/max intensity, range)

  Level 4: DOWNSTREAM METRICS (most important)
  - Run the same detector on sim LiDAR vs. real LiDAR
  - Compare detection mAP, recall, precision
  - Statistical hypothesis test: is the difference significant?
  - If detector performance is statistically equivalent,
    the simulation is "good enough" for that application

  Method: Collect paired data (drive real car with real sensor,
  reconstruct same scene in simulation, compare outputs).

Q5: Scaling Simulation in the Cloud

Question: "Design a system to run 10,000 simulation scenarios per night, each requiring a GPU for sensor rendering, with results available by 8 AM."

Answer Hints:

Architecture:

  1. JOB SCHEDULER (8 PM start)
     - Priority queue: safety-critical first, regression second
     - Dependency graph: some scenarios depend on others
     - Estimated runtime per scenario: ~5-15 minutes (GPU-rendered)

  2. RESOURCE ALLOCATION
     - 10,000 scenarios x 10 min avg = 100,000 GPU-minutes
     - 12 hours available (8 PM - 8 AM) = 720 GPU-minutes per GPU
     - Need: ~140 GPUs minimum (100,000 / 720)
     - Reserve: 200 GPUs (for retries, variance, overhead)
     - Use spot/preemptible instances (~70% cost savings)
     - Fallback to on-demand if spot gets reclaimed

  3. EXECUTION
     - Containerized sim (Docker + NVIDIA Container Toolkit)
     - Kubernetes for orchestration (or custom scheduler)
     - Pull scenario definition -> render -> evaluate -> upload results
     - Checkpoint and retry on failure (idempotent execution)

  4. RESULT AGGREGATION
     - Stream results as they complete (don't wait for all)
     - Aggregate metrics into dashboard
     - Compute regression statistics vs. baseline
     - Flag regressions with severity levels
     - Email/Slack notification if critical regressions found

  5. COST OPTIMIZATION
     - Spot instances: ~$0.30/hr vs $1.00/hr for GPU
     - Batch similar scenarios on same GPU (amortize asset loading)
     - Cache common assets (maps, actor models)
     - Progressive rendering (quick low-fi pass first, then high-fi)

  Total cost estimate: 200 GPUs x 12 hr x $0.30 = ~$720/night

Q6: Neural Sim Scene Editing

Question: "Given a Gaussian Splatting reconstruction of a street scene, how would you insert a new vehicle that wasn't in the original recording?"

Answer Hints:

Pipeline:

  1. SELECT INSERTION POINT
     - Choose 3D position and orientation on the road surface
     - Verify the position is physically valid (on road, not overlapping)
     - Determine ground plane at insertion point

  2. PREPARE ACTOR ASSET
     - PBR mesh (not Gaussians) for the new vehicle
     - Why PBR? Controllable, relightable, provides ground truth
     - Apply material properties matching scene lighting

  3. LIGHTING ESTIMATION
     - Estimate scene illumination from the Gaussian reconstruction
     - HDR environment map from the scene
     - Sun direction and intensity from shadow analysis

  4. COMPOSITING
     - Render PBR vehicle with estimated lighting
     - Alpha-composite over Gaussian splat rendering
     - Handle occlusion: vehicle may be partially behind Gaussians
     - Cast shadow: ray trace from sun through vehicle onto ground Gaussians

  5. MULTI-SENSOR CONSISTENCY
     - Camera: composite rendered vehicle image
     - LiDAR: add vehicle mesh to ray casting scene
     - Radar: compute RCS for vehicle geometry and material

  6. VALIDATION
     - Visual inspection: does the insertion look natural?
     - Shadow consistency: does the shadow match the scene lighting?
     - Scale consistency: is the vehicle the right size?
     - Downstream test: does the perception stack detect it correctly?

  Challenges:
  - Reflection handling (vehicle on wet road, mirrors)
  - Ground contact (vehicle tires touching road surface)
  - Lighting consistency (matching exposure, white balance)

Q7: Sim-to-Real Transfer for Object Detection

Question: "Describe an experiment to measure how much synthetic data helps improve a 3D object detector, and what controls you'd need."

Answer Hints:

Experimental Design:

  Independent Variable: Amount of synthetic data
  Dependent Variable: 3D mAP on held-out REAL test set

  Conditions:
  A. Real-only baseline:    Train on N real samples
  B. Synthetic-only:        Train on M synthetic samples
  C. Mixed (50/50):         Train on N/2 real + M/2 synthetic
  D. Pre-train + fine-tune: Pre-train on M synthetic, fine-tune on N real
  E. Progressive synthetic: Vary M = {10K, 50K, 100K, 500K, 1M}
                            with fixed N real samples
  F. Progressive real:      Fix M = 500K synthetic,
                            vary N = {100, 500, 1K, 5K, 10K, 50K}

  Controls:
  - Same model architecture for all conditions
  - Same hyperparameters (lr, batch size, augmentation)
  - Same real test set (never seen during training)
  - Multiple random seeds (3-5 runs per condition)
  - Same training compute budget (early stopping on val loss)

  Metrics:
  - 3D mAP @ IoU 0.5 and 0.7
  - Per-class AP (especially rare classes)
  - Performance at different ranges (0-30m, 30-50m, 50-80m)
  - Performance in different conditions (day/night, weather)

  Expected Results:
  - D > A > C > B (pre-train+finetune best, synthetic-only worst)
  - E shows diminishing returns after ~200K synthetic
  - F shows synthetic data helps most when real data is scarce
  - Rare class improvement should be dramatic

Q8: Ray Tracing for Radar Simulation

Question: "How would you use ray tracing to simulate an automotive radar sensor?"

Answer Hints:

Radar Ray Tracing Pipeline:

  1. TRANSMIT MODEL
     - FMCW chirp signal definition (bandwidth, chirp rate)
     - Antenna pattern (transmit beam width, sidelobes)
     - Multiple TX antennas for MIMO radar

  2. RAY CASTING
     - Cast rays from TX antenna into scene
     - Number of rays: 10K-100K per frame
     - Distribution: importance-sampled based on antenna pattern

  3. INTERSECTION & REFLECTION
     - Find first intersection with scene geometry
     - Compute surface normal at intersection
     - Determine material properties:
       * Metal: high reflectivity, specular
       * Plastic: low reflectivity, diffuse
       * Glass: angle-dependent (Fresnel)
     - Multi-path: trace secondary rays (up to 2-3 bounces)
       * Ground bounce under vehicles
       * Wall reflections in urban canyons
       * Guardrail multi-path

  4. RCS COMPUTATION
     - Radar Cross Section from geometry + material + angle
     - Accounts for surface roughness, curvature
     - Frequency-dependent (77 GHz for automotive)

  5. RECEIVE MODEL
     - Range = 2 * distance / c (round-trip time)
     - Doppler = 2 * v_radial / lambda
     - Angle from array processing (phase differences)
     - Add noise (thermal, phase noise, quantization)

  6. SIGNAL PROCESSING
     - Range-Doppler FFT (2D FFT of beat signal)
     - CFAR detection (threshold above noise floor)
     - Angle estimation (digital beamforming / MUSIC)
     - Clustering and object-level output

  Key Challenge: computational cost
  Solution: hybrid approach -- ray trace dominant paths,
            use statistical model for diffuse scattering

Q9: Gaussian Splatting Quality Issues

Question: "Your Gaussian Splatting reconstruction of a highway scene shows artifacts in the sky region and floaters near lamp posts. How would you fix these?"

Answer Hints:

Diagnosis & Fixes:

  SKY ARTIFACTS:
  - Cause: Gaussians try to fit the sky but sky is at "infinity"
  - Gaussians placed far away become huge, cause artifacts

  Fixes:
  a) Separate sky model (environment map or learned sky network)
  b) Prune Gaussians beyond max distance threshold
  c) Use sky segmentation mask to exclude sky from Gaussian loss
  d) Regularize Gaussian size (prevent very large Gaussians)

  FLOATERS NEAR LAMP POSTS:
  - Cause: Thin structures are hard to reconstruct from sparse views
  - Gaussians "bloom" into space around thin objects
  - Insufficient view coverage from ego vehicle's trajectory

  Fixes:
  a) Depth regularization: penalize Gaussians that don't align
     with LiDAR depth measurements
  b) Opacity regularization: encourage Gaussians to be fully
     opaque or fully transparent (avoid semi-transparent floaters)
  c) Scale regularization: limit maximum Gaussian size
  d) Multi-scale training: train at multiple resolutions
  e) Per-pixel depth supervision from LiDAR projection
  f) Visibility-based pruning: remove Gaussians not visible from
     enough training views

  General Quality Improvements:
  - Use LiDAR points for initialization (much better than random)
  - Exposure compensation across cameras (auto-exposure varies)
  - Lens distortion correction before training
  - Anti-aliased splatting (Mip-Splatting approach)

Q10: System Design -- Automated Neural Sim Pipeline

Question: "Design an automated pipeline that takes raw drive logs as input and produces simulation-ready reconstructed scenes. What are the components, failure modes, and quality gates?"

Answer Hints:

Pipeline Architecture:

  INPUT                    PROCESSING               OUTPUT
  =====                    ==========               ======

  Drive Log               Stage 1: PREPROCESSING
  - Camera images          - Pose estimation/refinement
  - LiDAR scans           - Timestamp synchronization
  - GPS/IMU               - Calibration verification
  - Metadata
                          Stage 2: SCENE UNDERSTANDING
                           - Object detection & tracking
                           - Semantic segmentation
                           - Dynamic object masking
                           - Sky/ground classification

                          Stage 3: STATIC RECONSTRUCTION
                           - LiDAR-initialized Gaussians
                           - Multi-view optimization
                           - Sky model fitting
                           - Regularization

                          Stage 4: DYNAMIC RECONSTRUCTION
                           - Per-actor mesh reconstruction
                           - Material estimation (PBR)
                           - Trajectory fitting

                          Stage 5: SCENE ASSEMBLY
                           - Place actors in static scene
                           - Lighting consistency check
                           - Multi-sensor rendering test     Reconstructed
                                                              Scene
                          Stage 6: QUALITY ASSURANCE          - Gaussian cloud
                           - Upstream metrics (PSNR, SSIM)   - Actor assets
                           - Downstream metrics (mAP delta)  - Scene graph
                           - Automatic pass/fail              - Metadata

  FAILURE MODES:
  1. Bad poses --> reconstruction is blurry or misaligned
     Gate: pose estimation confidence score > threshold

  2. Insufficient views --> holes in reconstruction
     Gate: minimum coverage area check

  3. Dynamic masking errors --> ghosting artifacts
     Gate: temporal consistency check on masks

  4. LiDAR-camera misalignment --> depth inconsistency
     Gate: reprojection error < threshold

  5. Over/under-exposure --> poor texture quality
     Gate: histogram analysis of training images

  QUALITY GATES (automatic):
  - Gate 1 (after preprocessing): pose quality + calibration check
  - Gate 2 (after reconstruction): PSNR > 28 dB, SSIM > 0.90
  - Gate 3 (after assembly): visual artifact detector (learned)
  - Gate 4 (after rendering): downstream perception metric delta < 5%

  MONITORING:
  - Pipeline throughput: scenes per hour
  - Failure rate by stage
  - Quality distribution over time
  - Compute cost per scene

  SLA: Drive log uploaded -> simulation-ready scene in < 8 hours
       Success rate: > 90% of logs produce passing quality

12. References

Applied Intuition Official Resources

Resource	URL
Company Homepage	https://www.appliedintuition.com
Blog	https://www.appliedintuition.com/blog
Careers	https://www.appliedintuition.com/careers
Defense Division	https://www.appliedintuitiondefense.com

Key Blog Posts

Title	Date	Topic
"S2GO: Streaming Sparse Gaussian Occupancy"	Feb 2026	ICLR 2026 paper on 3D occupancy
"SPACeR: Self-Play Anchoring with Centralized Reference Models"	Feb 2026	RL for traffic simulation
"Building for Scale: Cloud-Native HD Maps"	Jan 2026	Scalable mapping architecture
"SDS Launches in Europe"	Jan 2026	European autonomy stack offering
"2025 In Review: Building Physical AI"	Dec 2025	Annual retrospective
"Why Deterministic Execution in ADAS Middleware Matters"	Dec 2025	Action Graph, determinism
"AI, Autonomy, and America's Strategic Infrastructure"	Dec 2025	Congressional testimony
"Why Powertrain Diversity Is the Real Test of SDV Platforms"	Feb 2026	Vehicle OS for diverse powertrains
"The Future of Industrial Intelligence"	Feb 2026	Physical AI for industry
"Applied Intuition Cabin Intelligence"	Feb 2026	Operator interface for heavy equipment

Foundational Papers

Paper	Authors	Year	Relevance
"3D Gaussian Splatting for Real-Time Radiance Field Rendering"	Kerbl et al.	2023	Core technology for Neural Sim
"NeRF: Representing Scenes as Neural Radiance Fields"	Mildenhall et al.	2020	Predecessor to Gaussian Splatting
"Dynamic 3D Gaussians: Tracking by Persistent Dynamic View Synthesis"	Luiten et al.	2023	Dynamic scene handling
"Street Gaussians for Modeling Dynamic Urban Scenes"	Yan et al.	2024	Driving-specific Gaussians
"Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields"	Barron et al.	2022	Unbounded outdoor scenes
"CyCADA: Cycle-Consistent Adversarial Domain Adaptation"	Hoffman et al.	2018	Domain adaptation for synthetic data
"FDA: Fourier Domain Adaptation for Semantic Segmentation"	Yang & Soatto	2020	Simple domain adaptation
"Domain Randomization for Transferring Deep Neural Networks"	Tobin et al.	2017	Sim-to-real via randomization
"WayMax: An Accelerated, Data-Driven Simulator for Large-Scale AV Research"	Gulino et al.	2024	Competitor simulation approach
"UniAD: Planning-Oriented Autonomous Driving"	Hu et al.	2023	End-to-end AV architecture reference

Industry & Market Analysis

Source	Topic
RAND Corporation: "Driving to Safety"	Framework for statistical AV safety validation (11 billion miles argument)
ISO 26262 (2018 edition)	Functional safety standard for automotive
ISO 21448 (SOTIF)	Safety of the Intended Functionality
Euro NCAP 2026 Roadmap	Evolving requirements driving simulation adoption
UL 4600	Safety standard for autonomous products

Project	Description	Relevance
CARLA Simulator	Open-source AV simulator (UE4)	Baseline comparison
WayMax	Waymo's open-source simulator (JAX)	Competitor approach
nuScenes devkit	Dataset tools and evaluation	Standard data format
Open3D	3D data processing library	Point cloud tooling
gsplat	Gaussian Splatting library	Reference implementation
nerfstudio	NeRF framework	Neural rendering research
COLMAP	Structure from Motion	Pose estimation reference

Appendix A: Glossary

Term	Definition
3DGS	3D Gaussian Splatting -- representing scenes as collections of 3D Gaussian ellipsoids
ADAS	Advanced Driver Assistance Systems (L1-L2 autonomy)
AV	Autonomous Vehicle
BVH	Bounding Volume Hierarchy -- acceleration structure for ray tracing
CFAR	Constant False Alarm Rate -- radar detection algorithm
Domain Gap	Statistical difference between synthetic and real data distributions
FDA	Fourier Domain Adaptation -- style transfer via frequency domain
FMCW	Frequency-Modulated Continuous Wave -- automotive radar signal type
FPS	Frames Per Second
GS	Gaussian Splatting
HDR	High Dynamic Range
ISP	Image Signal Processor -- converts raw sensor data to usable images
L2/L2++/L3/L4	SAE autonomy levels (2=partial, 3=conditional, 4=high automation)
LPIPS	Learned Perceptual Image Patch Similarity
LWIR	Long-Wave Infrared (thermal imaging, 8-14 um)
mAP	Mean Average Precision -- standard detection metric
NeRF	Neural Radiance Field -- implicit 3D scene representation
NIR	Near-Infrared (0.7-1.4 um wavelength)
OEM	Original Equipment Manufacturer (e.g., Toyota, VW)
PBR	Physically Based Rendering -- material model matching real physics
PSNR	Peak Signal-to-Noise Ratio -- image quality metric
RCS	Radar Cross Section -- measure of radar reflectivity
SDS	Self-Driving System (Applied Intuition's autonomy stack)
SfM	Structure from Motion -- 3D reconstruction from images
SOTIF	Safety of the Intended Functionality (ISO 21448)
SSIM	Structural Similarity Index Measure
TCL	Tool Confidence Level (ISO 26262)
V&V	Verification and Validation

Appendix B: Key Metrics Cheat Sheet

+======================================================================+
|                 METRICS YOU SHOULD KNOW                                |
+======================================================================+

RENDERING QUALITY:
  PSNR:   Peak Signal-to-Noise Ratio    Higher = better  (good: >30 dB)
  SSIM:   Structural Similarity          Higher = better  (good: >0.92)
  LPIPS:  Perceptual Similarity          Lower  = better  (good: <0.10)
  FID:    Frechet Inception Distance     Lower  = better  (good: <25)

DETECTION:
  mAP:    Mean Average Precision         Higher = better  (good: >70%)
  AP@50:  AP at IoU threshold 0.5        Higher = better
  AP@75:  AP at IoU threshold 0.75       Higher = better  (stricter)
  Recall: Detection rate                 Higher = better  (good: >90%)

TRACKING:
  MOTA:   Multi-Object Tracking Accuracy Higher = better  (good: >80%)
  MOTP:   Multi-Object Tracking Precision Higher = better
  IDF1:   ID F1 Score                    Higher = better

POINT CLOUD:
  CD:     Chamfer Distance               Lower  = better  (good: <0.05m)
  EMD:    Earth Mover's Distance         Lower  = better
  F1@d:   F1 score at distance threshold Higher = better

DOMAIN GAP:
  FID:    Frechet Inception Distance     Lower  = better (between domains)
  A-distance: Proxy A-distance           Lower  = better (feature space)
  mAP delta: Real vs. Sim performance    Lower  = better (good: <3%)

+======================================================================+

This deep dive was prepared as an onboarding reference for engineers joining Applied Intuition's Neural Sim and Synthetic Data teams. Information is current as of March 2026 and is based on publicly available sources including Applied Intuition's website, blog posts, job postings, press releases, published research papers, and industry analysis. Some technical details are inferred from public information and general industry knowledge.

Applied Intuition: Platform Architecture & Simulation Technology

Table of Contents

1. Executive Summary

Why This Matters

2. Company Background

2.1 Founding Story

2.2 Mission: Accelerate Safe and Intelligent Machines

2.3 Business Model: B2B SaaS for OEMs

2.4 Market Position

2.5 Key Partnerships

2.6 Acquisitions

3. Core Product Suite

3.1 Object Sim

3.2 Sensor Sim

3.3 Neural Sim

3.4 Log Sim

3.5 Synthetic Datasets

3.6 Cloud Engine

3.7 Data Explorer

3.8 Validation Toolset

3.9 Map Toolset

3.10 Copilot

3.11 Self-Driving System (SDS)

4. Neural Sim Technical Architecture

4.1 The Core Problem

4.2 Scene Reconstruction Pipeline

4.3 Automated ML Pipeline

4.4 Validation Metrics

4.5 Daily Validation in SDS Program

4.6 Research Frontier: S2GO (ICLR 2026)

5. Synthetic Data Technical Architecture

5.1 The Data Problem in AV Development

5.2 Generation Pipeline

5.3 Scene Generation Approaches

5.4 Supported Labels

5.5 Domain Gap Mitigation

5.6 Training Best Practices

5.7 Case Studies

6. Sensor Simulation Engine

6.1 Custom Vulkan Rendering Engine

6.2 Ray Tracing Benefits

6.3 Physics-Based Sensor Models

6.4 Three Perception Simulation Strategies

7. Defense & Multi-Domain Expansion

7.1 Applied Intuition Defense

7.2 Axion: Mission-Critical Toolchain

7.3 Acuity: Onboard Autonomy Software

7.4 Defense Programs

7.5 UK Expansion

7.6 Congressional Testimony

8. Competitive Landscape

8.1 Detailed Competitor Comparison

8.2 Competitor Deep Dives

8.3 Applied Intuition's Competitive Advantages

8.4 Technology Bets and Risks

9. Team Structure & What to Expect

9.1 Key Engineering Teams

9.2 Skills Needed Per Team

9.3 Work Culture

9.4 Compensation Ranges

10. Onboarding Preparation

10.1 Technical Skills to Brush Up On

10.2 Papers to Read Before Joining

10.3 Products to Understand

10.4 Questions to Ask During Interviews

11. Interview Questions

Q1: Gaussian Splatting vs. NeRF for Driving Scene Reconstruction

Q2: Handling Dynamic Objects in Scene Reconstruction

Q3: Domain Gap in Synthetic Data

Q4: Sensor Simulation Fidelity

Q5: Scaling Simulation in the Cloud

Q6: Neural Sim Scene Editing

Q7: Sim-to-Real Transfer for Object Detection

Q8: Ray Tracing for Radar Simulation

Q9: Gaussian Splatting Quality Issues

Q10: System Design -- Automated Neural Sim Pipeline

12. References

Applied Intuition Official Resources

Key Blog Posts

Foundational Papers