In the previous chapter, we covered structs (13.7 -- Introduction to structs, members, and member selection), and discussed how they are great for bundling multiple member variables into a single object that can be initialized and passed around as a unit. In other words, structs provide a convenient package for storing and moving related data values.
Consider the following struct:
#include <iostream>
struct Date
{
int day{};
int month{};
int year{};
};
void printDate(const Date& date)
{
std::cout << date.day << '/' << date.month << '/' << date.year; // assume DMY format
}
int main()
{
Date date{ 4, 10, 21 }; // initialize using aggregate initialization
printDate(date); // can pass entire struct to function
return 0;
}
In the above example, we create a Date
object and then pass it to a function that prints the date. This program prints:
4/10/21
A reminder
In these tutorials, all of our structs are aggregates. We discuss aggregates in lesson 13.8 -- Struct aggregate initialization.
As useful as structs are, structs have a number of deficiencies that can present challenges when trying to build large, complex programs (especially those worked on by multiple developers).
The class invariant problem
Perhaps the biggest difficulty with structs is that they do not provide an effective way to document and enforce class invariants. In lesson 9.6 -- Assert and static_assert, we defined an invariant as, “a condition that must be true while some component is executing”.
In the context of class types (which include structs, classes, and unions), a class invariant is a condition that must be true throughout the lifetime of an object in order for the object to remain in a valid state. An object that has a violated class invariant is said to be in an invalid state, and unexpected or undefined behavior may result from further use of that object.
Key insight
Using an object whose class invariant has been violated may result in unexpected or undefined behavior.
First, consider the following struct:
struct Pair
{
int first {};
int second {};
};
The first
and second
members can be independently set to any value, so Pair
struct has no invariant.
Now consider the following almost-identical struct:
struct Fraction
{
int numerator { 0 };
int denominator { 1 };
};
We know from mathematics that a fraction with a denominator of 0
is mathematically undefined (because the value of a fraction is its numerator divided by its denominator -- and division by 0
is mathematically undefined). Therefore, we want to ensure the denominator
member of a Fraction object is never set to 0
. If it is, then that Fraction object is in an invalid state, and undefined behavior may result from further use of that object.
For example:
#include <iostream>
struct Fraction
{
int numerator { 0 };
int denominator { 1 }; // class invariant: should never be 0
};
void printFractionValue(const Fraction& f)
{
std::cout << f.numerator / f.denominator << '\n';
}
int main()
{
Fraction f { 5, 0 }; // create a Fraction with a zero denominator
printFractionValue(f); // cause divide by zero error
return 0;
}
In the above example, we use a comment to document the invariant of Fraction. We also provide a default member initializer to ensure that denominator is set to 1
if the user does not provide an initialization value. This ensures our Fraction object will be valid if the user decides to value initialize a Fraction object. That’s an okay start.
But nothing prevents us from explicitly violating this class invariant: When we create Fraction f
, we use aggregate initialization to explicitly initialize the denominator to 0
. While this does not cause an immediate issue, our object is now in an invalid state, and further use of the object may cause unexpected or undefined behavior.
And that is exactly what we see later, when we call printFractionValue(f)
: the program terminates due to a divide-by-zero error.
As an aside…
A small improvement would be to assert(f.denominator != 0);
at the top of the body of printFractionValue
. This adds documentation value to the code, and makes it more obvious what precondition is being violated. However, behaviorally, this doesn’t really change anything. We really want to catch these problems at the source of the problem (when the member is initialized or assigned a bad value), not somewhere downstream (when the bad value is used).
Given the relative simplicitly of the Fraction example, it shouldn’t be too difficult to simply avoid creating invalid Fraction objects. However, in a more complex code base that uses many structs, structs with many members, or structs whose members have complex relationships, understanding what combination of values might violate some class invariant may not be so obvious.
A more complex class invariant
The class invariant for Fraction is a simple one -- the denominator
member cannot be 0
. That’s conceptually easy to understand and not overly difficult to avoid.
Class invariants become more of a challenge when the members of a struct must have correlated values.
#include <string>
struct Employee
{
std::string name { };
char firstInitial { }; // should always hold first character of `name` (or `0`)
};
In the above (poorly designed) struct, the character value stored in member firstInitial
should always match the first character of name
.
When an Employee
object is initialized, the user is responsible for making sure the class invariant is maintained. And if name
is ever assigned a new value, the user is also responsible for ensuring firstInitial
is updated as well. This correlation may not be obvious to a developer using an Employee object, and even if it is, they may forget to do it.
Even if we write functions to help us create and update Employee objects (ensuring that firstInitial
is always set from the first character of name
), we’re still relying on the user to be aware of and use these functions.
In short, relying on the user of an object to maintain class invariants is likely to result in problematic code.
Key insight
Relying on the user of an object to maintain class invariants is likely to result in problems.
Ideally, we’d love to bulletproof our class types so that an object either can’t be put into an invalid state, or can signal immediately if it is (rather than letting undefined behavior occur at some random point in the future).
Structs (as aggregates) just don’t have the mechanics required to solve this problem in an elegant way.
Introduction to classes
When developing C++, Bjarne Stroustrup wanted to introduce capabilities that would allow developers to create program-defined types that could be used more intuitively. He was also interested in finding elegant solutions to some of the frequent pitfalls and maintenance challenges that plague large, complex programs (such as the previously mentioned class invariant issue).
Drawing upon his experience with other programming languages (particularly Simula, the first object-oriented programming language), Bjarne was convinced that it was possible to develop a program-defined type that was general and powerful enough to be used for almost anything. In a nod to Simula, he called this type a class.
Just like structs, a class is a program-defined compound type that can have many member variables with different types.
Key insight
From a technical standpoint, structs and classes are almost identical -- therefore, any example that is implemented using a struct could be implemented using a class, or vice-versa. However, from a practical standpoint, we use structs and classes differently.
We cover both the technical and practical differences between structs and classes in lesson 14.5 -- Public and private members and access specifiers
Related content
We cover how classes solve the invariant problem in lesson 14.8 -- The benefits of data hiding (encapsulation).
Defining a class
Because a class is a program-defined data type, it must be defined before it can be used. Classes are defined similarly to structs, except we use the class
keyword instead of struct
. For example, here is a definition for a simple employee class:
class Employee
{
int m_id {};
int m_age {};
double m_wage {};
};
Related content
We discuss why member variables of a class are often prefixed with an “m_” in upcoming lesson 14.5 -- Public and private members and access specifiers
To demonstrate how similar classes and structs can be, the following program is equivalent to the one we presented at the top of the lesson, but Date
is now a class instead of a struct:
#include <iostream>
class Date // we changed struct to class
{
public: // and added this line, which is called an access specifier
int m_day{}; // and added "m_" prefixes to each of the member names
int m_month{};
int m_year{};
};
void printDate(const Date& date)
{
std::cout << date.m_day << '/' << date.m_month << '/' << date.m_year;
}
int main()
{
Date date{ 4, 10, 21 };
printDate(date);
return 0;
}
This prints:
4/10/21
Related content
We cover what an access specifier is in upcoming lesson 14.5 -- Public and private members and access specifiers.
Most of the C++ standard library is classes
You have already been using class objects, perhaps without knowing it. Both std::string
and std::string_view
are defined as classes. In fact, most of the non-aliased types in the standard library are defined as classes!
Classes are really the heart and soul of C++ -- they are so foundational that C++ was originally named “C with classes”! Once you are familiar with classes, much of your time in C++ will be spent writing, testing, and using them.
Quiz time
Question #1
Given some set of values (ages, address numbers, etc…), we might want to know what the minimum and maximum values are in that set. Since the minimum and maximum values are related, we can organize them in a struct, like so:
struct minMax
{
int min; // holds the minimum value seen so far
int max; // holds the maximum value seen so far
};
However, as written, this struct has an unspecified class invariant. What is the invariant?