Unified Manufacturing Data Architecture
A plain language white paper for leaders, engineers, and practitioners who want a clear path from scattered data to scalable AI.
Manufacturing is on the cusp of something extraordinary.
After years—decades, even—of wrestling with disconnected systems, patchwork integrations, and siloed data, the industry is finally asking deeper, more strategic questions. How do we stop drowning in data and start using it? How do we design architectures not just for real time visibility, but for intelligence? How do we build infrastructure that doesn’t just support people, but empowers AI?
Much of this shift in thinking can be credited to a movement—not a product or a platform, but a mindset—embodied by the idea of the Unified Namespace. It’s sparked curiosity. It’s given engineers and architects a vocabulary to talk about data flow and context. And maybe most importantly, it’s brought people together. For the first time in a long time, there’s community in manufacturing data—people willing to share what works, admit what doesn’t, and help each other move forward.
But here’s the truth: asking the right questions is only half the battle.
As exciting as it is to publish data, build dashboards, and wire up event-driven systems, many teams are now discovering that they’re still not ready for what comes next. The age of agentic AI is here. Autonomous systems are learning to reason, coordinate, and act. And most factories—despite the progress—aren’t prepared.
It’s like designing a state-of-the-art smart home, complete with connected appliances, a smart shower, and motion sensing lights, only to realize no one planned the plumbing or ensured the wiring was standardized. The potential is obvious. The latest technology is available. But the foundation isn’t there.
That’s what this paper is about.
Ryan Hill has written something rare—a pragmatic, scalable, and deeply considered framework for how manufacturing organizations can actually prepare for the future. It’s called the Unified Manufacturing Data Architecture, or UMDA. And whether you’re starting from scratch, extending an existing UNS, or migrating from legacy systems, this framework stands on its own.
Ryan has done something few others have: he’s named the blind spots before they become disasters. He’s documented patterns that most don’t notice until it’s too late. And he’s offered a path forward that’s both visionary and grounded—technical where it needs to be, but never detached from real-world constraints.
This paper is generous. It’s insightful. And it’s honest.
It’s not just about architecting systems. It’s about protecting investments and extending the shelf life of existing solutions. Accelerating outcomes. Enabling intelligence. And most of all, avoiding the painful moment too many manufacturers know too well—when millions have been spent, and yet, the value still feels just out of reach.
So if you’re building. If you’re fixing. If you’re dreaming.
Jeff Knepper, President, Flow Software
Read this.
Then build better.
Introduction
Manufacturing is at a turning point. After years of connecting machines and shipping dashboards, the question has changed. It is no longer how to collect more data. It is how to use it. The goal is intelligence you can act on, not more charts to look at.
The Unified Namespace sparked a useful shift. It gave teams a way to talk about real time data flow and context. It also showed the limits of publishing without a broader design. Agentic AI is now practical. Systems can reason, coordinate, and act. Most factories are not ready because the foundation is still uneven.
This paper offers that foundation. It describes the Unified Manufacturing Data Architecture, or UMDA. It is a practical, scalable framework that respects shop floor reality and applies proven IT patterns. It connects what you already own. It preserves context. It makes AI reliable and repeatable across sites.
What UMDA is and what it changes
What it is. A vendor agnostic, layered architecture built for manufacturing. It includes domain Common Data Models, a Unified Namespace, a Unified Data Layer, local Edge Intelligence Hubs, and a Feedback Data Layer. It is shop floor first and IT friendly.
What it changes. Data keeps its context as it moves. Timestamps align across systems. People and AI consume the same source of truth. Models route to the right place and scale from one line to many sites without rewrites.
Why it works. It starts where the work happens. It blends OT constraints with IT discipline. No rip and replace. You can start small, prove value, and expand by template.
1. The current reality: disconnected data, unmet potential
Plants capture vibration, temperature, torque, quality checks, work orders, and schedule changes. Yet most teams still fight fires. The gap is not data volume. It is the lack of connected, contextual, and trustworthy information that drives coordinated action.
The fragmented landscape
SCADA, CMMS, LIMS, MES, and ERP were deployed in isolation. Each system is optimized for its own domain. They do not share a common vocabulary or time base. A maintenance insight rarely influences production scheduling in time to matter. A quality signal often lands after the fact. Everyone has a piece of the puzzle. Few can see the whole picture soon enough to act.
A unique industrial challenge
Factories blend modern IoT with legacy PLCs and proprietary protocols. Some assets stream millisecond data. Others live on clipboards. The data represents physical processes, people, and materials. Interpreting it needs domain knowledge that sits with experienced operators and engineers. Generic IT tools alone do not solve that gap.
The human cost of data chaos
Skilled people spend hours reconciling tags, units, and timestamps to answer basic questions. Action is late. Root cause work becomes guesswork. AI pilots stall after a Proof of Concept project because the inputs are inconsistent and poorly labeled. Confidence drops and teams fall back to spreadsheets and tribal knowledge.
The cost of disconnection
- A vibration alert in a stand alone tool never reaches the line lead. The pump fails. Downtime and rush repairs follow.
- A blend ratio drift is visible in batching data. Quality does not see it in time. Finished goods ship with defects.
- Maintenance follows time based schedules without condition data. Healthy assets get over serviced. At risk assets are missed.
- Schedules are based on assumed capacity because live performance is trapped in HMIs or isolated historians.
Why traditional data solutions fall short
Data lakes and enterprise integration tools were built for clean, uniform data. OT data is noisy, inconsistent, and context poor. Without a domain model and clear contracts, you get a collection of files, not a system that makes sense to people or AI. One off scripts fill the gap until they collapse under scale.
Summary. A weak foundation cannot carry AI. Fix context, time alignment, and ownership first. Then let AI amplify it.
2. The vision: a Unified Manufacturing Data Architecture
UMDA is a plain framework for making plant data useful. It is vendor agnostic and designed to scale. It does not ask you to replace what you own. It connects and elevates it. The result is a system that is understandable to humans and actionable for AI.
A framework purpose built for manufacturing
UMDA assumes messy reality. Old and new equipment. Different goals by team. Gaps in labels and units. It uses a layered design to capture, organize, and carry context from the edge to the enterprise. It makes data consistent enough for decisions and automation without breaking local control.
From data points to operational intelligence
AI fails when it sees numbers without meaning. UMDA adds meaning. A tag becomes Reactor A during Batch 42 with Operator Emily at HeatUp phase. A vibration tag becomes Line 5 pump near threshold after recent maintenance. With that context, models move from pattern spotting to sound recommendations.
Reimagining the factory floor
- Predictive maintenance plans work with schedules, energy prices, labor, and warranty terms to pick the right window.
- Dynamic quality adjusts parameters in real time, retrains visual models as defects evolve, and alerts suppliers when needed.
- Self optimizing lines tune routing, batch order, and crew assignments based on business constraints, not guesses.
- Supply chain signals blend with plant throughput and forecast data to suggest alternatives before disruption hits.
Summary. UMDA treats data as an asset and context as non negotiable. It turns scattered signals into decisions you can trust.
3. Pillars of UMDA: building the foundation
From chaos to clarity: the Common Data Model
The Common Data Model, or CDM, is the shared language. It defines assets, processes, materials, people, events, and their relationships. It sets required fields such as timestamp, equipment id, batch id, operator id, value, and units. It is versioned so history stays queryable after changes. It has clear ownership so someone cares about quality.
With a CDM, thousands of tags turn into a plant model you can query and reuse. Templates let you roll the same model across similar assets, lines, and sites with minor tweaks.
Federation at the edge: local power, enterprise context
Not all data should move. UMDA keeps data where it makes sense and exposes a unified view. Edge nodes ingest with contracts, tag context before publish, buffer during network loss, and make fast decisions locally. Enterprise systems provide richer context from MES, ERP, and CMMS. The Unified Namespace routes events without copies.
The Unified Data Layer: connecting the dots
The UDL is the backbone. It harmonizes streams, preserves lineage, and serves governed access. It supports ingestion from OT and IT, storage in open formats, processing into events, KPIs, and features, and access through SQL, REST, and GraphQL. It also hosts the semantic layer that defines dimensions and measures for self service analytics.
The Feedback Data Layer closes the loop. It logs predictions, confidence, human responses, and outcomes. It provides retraining sets and audit trails. It turns pilots into systems that keep getting better.
Summary. CDM, federation, UDL, and FDL do more than organize data. They create insight, scale, and evolution.
4. The AI enabled factory: A symphony of systems
From automation to orchestration
Rules alone cannot handle today’s variability. UMDA lets systems share context and adapt in real time. Machines, software, and people coordinate as one system. Each part does its job while listening to the rest.
Seeing cause, not just correlation: causal AI
Pattern models find correlations. Causal models test interventions and predict outcomes. They answer what happens if you slow a line, change a profile, or swap a lot. UMDA provides clean history and relationships so causal models can work.
Digital twins and digital threads: mirror and memory
The twin mirrors current state for safe simulation. The thread records full history across batches, materials, and shifts. UMDA feeds both with accurate context so you can test changes and trace outcomes without guesswork.
Autonomous agents and LLM routers
Specialized agents focus on energy, quality, maintenance, and flow. The LLM router sends each task to the right model based on domain, latency, and cost. Low confidence cases go to people with full traceability. Outcomes write back to the Feedback Data Layer to improve the next decision.
Summary. Intelligence becomes part of the infrastructure. UMDA fuses real time signals with history, local action with enterprise goals, and human judgment with machine learning.
5. Preparing for the future: strategic implementation and risk
Local control and enterprise consistency
Sites need autonomy. Enterprises need consistency. UMDA balances both. Local CDMs reflect floor reality and map to enterprise models. Data contracts define shape, units, latency, and availability. Plants move fast without breaking the shared language.
The human architecture
UMDA succeeds when IT, OT, data, operations, and quality plan and own it together. Decide who approves schemas, who manages topics, who promotes models, and how changes roll out across shifts. Shared ownership builds trust and speed.
Trust and explainability
Validate data at ingress. Monitor drift. Log decisions with reasons. Show operators which signals drove a recommendation. Clarity earns trust. Trust drives adoption. Adoption drives value.
Security and compliance by design
Protect identity, transport, and storage. Use role based access, encryption in motion and at rest, network segmentation, signed artifacts, and mapped controls for your standards. Build this in from day one, not as a bolt on later.
Summary. UMDA is a capability you build. It embeds governance, security, and explainability so you can depend on AI with confidence.
6. Your roadmap to an AI enabled enterprise
Start with the problem, not the platform
- Pick a single, high leverage outcome such as cutting unplanned downtime on a critical asset, improving first pass yield, or stabilizing cycle time on a bottleneck.
- Define the metric, the target, the payback, and the owner.
Build a foundation for speed and scale
- Inventory sources, tags, units, and timestamp precision. Fix time sync to UTC as needed.
- Map to a minimal CDM and add required context fields. Publish a data contract for each feed.
- Stand up edge ingestion with validation and context tagging. Publish to the Unified Namespace.
- Land UDL tables in open formats with lineage. Expose access for operators, engineers, and data teams.
Enable intelligence without losing control
- Deliver the first model in Shadow. Move to Recommend with thresholds and audit. Enter Controlled for a small, preapproved action with instant rollback.
- Log predictions, actions, and outcomes in the Feedback Data Layer. Monitor false positives and operator adoption.
- Target quick wins in weeks, not months. Document and template the pattern.
Scale by template
Turn the working CDM, pipelines, topics, and dashboards into a reusable template. Apply it to the next similar asset, line, or site. Keep site autonomy. Keep enterprise consistency.
Summary. The roadmap is simple. Pick a problem. Shape the data. Stand up the path. Ship the first model. Close the loop. Scale the template.
7. The journey ahead
UMDA is a practical way to build a factory that thinks with you. It respects the shop floor, speaks the language of IT, and gives AI the context it needs to help at scale. It turns data into decisions you can trust. It protects current investments and creates a clear path to expand.
If you want the full playbook, the longer work titled Manufacturing AI: Building the Data Foundation for the Next Industrial Revolution goes deeper. It includes design patterns, deployment checklists, and lessons learned that reduce risk and speed results.
What you get in the full text
- Comprehensive implementation frameworks that move pilots to enterprise systems
- Detailed guidance for feedback loops, agentic AI, and cross system orchestration
- Hard won lessons that help you avoid common traps before they get expensive
- Strategies for federated governance, edge to cloud coordination, and high trust human AI collaboration
Manufacturing is changing. And those who win will be the ones who stop chasing trends and start designing systems. This paper may have shown you what’s possible. The book shows you how.
The next industrial revolution has already begun.
The question isn’t if you’ll participate.
It’s how well prepared you’ll be.
And that preparation starts now, with the data beneath your feet.
