Overview - Data quality flags

What is it?

Data quality flags are simple markers attached to data points in SCADA systems to show if the data is good, questionable, or bad. They help operators and automated systems quickly understand if the data can be trusted or needs attention. These flags are usually codes or symbols that describe the condition or reliability of the data. They make monitoring and decision-making safer and more efficient.

Why it matters

Without data quality flags, operators might trust faulty or missing data, leading to wrong decisions that can cause equipment damage, safety hazards, or production losses. Flags help catch problems early by signaling when data is unreliable. This prevents costly mistakes and improves system safety and performance. In short, they make sure the data driving important actions is trustworthy.

Where it fits

Before learning about data quality flags, you should understand basic SCADA data collection and sensor operation. After this, you can explore data validation techniques and alarm management. Data quality flags fit into the data monitoring and control part of SCADA systems.

Mental Model

Core Idea

Data quality flags are simple signals attached to data points that tell you if the data is trustworthy or needs attention.

Think of it like...

It's like traffic lights for data: green means go (data is good), yellow means caution (data might be off), and red means stop (data is bad or missing).

┌───────────────┐
│   Data Point  │
├───────────────┤
│ Value: 123.45 │
│ Flag: Green   │
└───────────────┘

Flags meaning:
Green  = Good data
Yellow = Questionable data
Red    = Bad or missing data

Build-Up - 6 Steps

1

FoundationWhat are data quality flags

Concept: Introduce the basic idea of marking data with quality indicators.

In SCADA systems, every data point collected from sensors can have a small code called a data quality flag. This flag tells if the data is good, suspect, or bad. For example, a temperature sensor reading might have a flag that says 'good' if the sensor works well, or 'bad' if the sensor failed.

Result

You understand that data quality flags are simple labels attached to data points to show their trustworthiness.

Knowing that data can have quality labels helps you realize that not all data is equally reliable, which is key for safe control.

2

FoundationCommon types of quality flags

3

IntermediateHow flags are set automatically

4

IntermediateUsing flags in alarms and control

5

AdvancedCustomizing flags for complex systems

6

ExpertChallenges and pitfalls with data flags

Under the Hood

Data quality flags are stored as metadata alongside each data point in the SCADA database or historian. When data arrives, the SCADA software runs validation checks like range tests, sensor status, and communication health. Based on these checks, it assigns a flag code. Flags propagate through the system to displays, alarms, and control logic. Internally, flags are often simple integers or bit fields representing different conditions.

Why designed this way?

Flags were created to provide a lightweight, standardized way to communicate data trustworthiness without changing the data itself. Early SCADA systems needed a simple method to mark data issues that operators could see quickly. Alternatives like separate error logs were too slow or complex. Flags balance simplicity, speed, and clarity.

┌───────────────┐      ┌───────────────┐      ┌───────────────┐
│ Sensor Data   │─────▶│ Validation    │─────▶│ Data + Flag   │
│ (raw value)   │      │ Checks        │      │ Stored in DB  │
└───────────────┘      └───────────────┘      └───────────────┘
                                │
                                ▼
                      ┌─────────────────┐
                      │ Flag Assignment │
                      └─────────────────┘

Myth Busters - 4 Common Misconceptions

Quick: do you think a 'Suspect' flag means the data is definitely wrong? Commit yes or no.

Common Belief:A 'Suspect' flag means the data is bad and should be ignored.

Tap to reveal reality

Quick: do you think data quality flags fix sensor problems automatically? Commit yes or no.

Common Belief:Flags fix or correct bad data automatically.

Tap to reveal reality

Quick: do you think all SCADA systems use the same flag codes? Commit yes or no.

Common Belief:All SCADA systems use the same standard data quality flags.

Tap to reveal reality

Quick: do you think more flags always improve data quality understanding? Commit yes or no.

Common Belief:Adding more detailed flags always helps operators understand data better.

Tap to reveal reality

Expert Zone

1

Some flags are set based on sensor health trends, not just single data points, to catch slow failures.

2

Flags can be combined using bitwise operations to represent multiple simultaneous issues efficiently.

3

Operators often develop personal heuristics for interpreting flags beyond official definitions.

When NOT to use

Data quality flags are less useful in systems with very low data volume or where manual data validation is feasible. In such cases, direct sensor diagnostics or manual checks may be better. Also, in systems requiring real-time control with zero latency, flag processing delays might be unacceptable.

Production Patterns

In production SCADA systems, flags are integrated with alarm management to suppress false alarms and prioritize operator attention. They are also used in historical data analysis to filter out bad data before reporting or machine learning. Large plants customize flags per sensor type and use dashboards to visualize flag trends over time.

Connections

Error detection and correction codes

Both mark data issues but error codes fix data while flags only mark quality.

Understanding flags helps grasp how systems signal data problems without altering the data itself, unlike error correction.

Traffic light signaling systems

Flags and traffic lights both use simple color codes to communicate status quickly.

Recognizing this pattern shows how simple signals can guide complex decisions safely and efficiently.

Quality control in manufacturing

Both use markers to indicate if a product or data meets standards or needs review.

Knowing this connection highlights how quality flags are part of a broader practice of ensuring reliability in processes.

Common Pitfalls

#1Ignoring 'Suspect' flags and treating all data as good.

Wrong approach:if (data.flag == 'Good' || data.flag == 'Suspect') { process(data.value); }

Correct approach:if (data.flag == 'Good') { process(data.value); } else { alert('Check data quality'); }

Root cause:Misunderstanding that 'Suspect' data may be unreliable and should be reviewed before use.

#2Setting flags manually without automated checks.

Wrong approach:// Operator sets flags manually data.flag = 'Good'; // even if sensor failed

Correct approach:// System sets flags automatically if (sensor.status == 'OK') { data.flag = 'Good'; } else { data.flag = 'Bad'; }

Root cause:Relying on manual flagging leads to delays and errors in detecting data problems.

#3Using too many detailed flags confusing operators.

Wrong approach:flags = ['Good', 'Suspect', 'Bad', 'Calibration Needed', 'Communication Delay', 'Sensor Drift', 'Out of Range', 'Power Failure'];

Correct approach:flags = ['Good', 'Suspect', 'Bad']; // Keep it simple for clarity

Root cause:Overcomplicating flags reduces operator ability to quickly understand data quality.

Key Takeaways

Data quality flags are simple markers that show if SCADA data is trustworthy or needs attention.

They help operators and systems avoid mistakes by signaling when data is good, suspect, or bad.

Flags are usually set automatically by the system based on checks like sensor status and data range.

Proper use of flags improves safety, reliability, and efficiency in monitoring and control.

However, flags must be designed carefully to avoid confusion, false alarms, or ignored problems.