The Definitive Guide to DAX: Business intelligence with Microsoft Power BI, SQL Server Analysis Services, and Excel, Second Edition (for pre sume)

Chapter 2
Introducing DAX

In this chapter, we start talking about the DAX language. Here you learn the syntax of the language, the difference between a calculated column and a measure (also called calculated field, in certain old Excel versions), and the most commonly used functions in DAX.

Because this is an introductory chapter, it does not cover many functions in depth. In later chapters, we explain them in more detail. For now, introducing the functions and starting to look at the DAX language in general are enough. When we reference features of the data model in Power BI, Power Pivot, or Analysis Services, we use the term Tabular even when the feature is not present in all the products. For example, “DirectQuery in Tabular” refers to the DirectQuery mode feature available in Power BI and Analysis Services but not in Excel.

Understanding DAX calculations

Before working on more complex formulas, you need to learn the basics of DAX. This includes DAX syntax, the different data types that DAX can handle, the basic operators, and how to refer to columns and tables. These concepts are discussed in the next few sections.

We use DAX to compute values over columns in tables. We can aggregate, calculate, and search for numbers, but in the end, all the calculations involve tables and columns. Thus, the first syntax to learn is how to reference a column in a table.

The general format is to write the table name enclosed in single quotation marks, followed by the column name enclosed in square brackets, as follows:

'Sales'[Quantity]

We can omit the single quotation marks if the table name does not start with a number, does not contain spaces, and is not a reserved word (like Date or Sum).

The table name is also optional in case we are referencing a column or a measure within the table where we define the formula. Thus, [Quantity] is a valid column reference, if written in a calculated column or in a measure defined in the Sales table. Although this option is available, we strongly discourage you from omitting the table name. At this point, we do not explain why this is so important, but the reason will become clear when you read Chapter 5, “Understanding CALCULATE and CALCULATETABLE.” Nevertheless, it is of paramount importance to be able to distinguish between measures (discussed later) and columns when you read DAX code. The de facto standard is to always use the table name in column references and always avoid it in measure references. The earlier you start adopting this standard, the easier your life with DAX will be. Therefore, you should get used to this way of referencing columns and measures:

DAX Data Type	Power BI Data Type	Power Pivot and Analysis Services Data Type	Correspondent Conventional Data Type (e.g., SQL Server)	Tabular Object Model (TOM) Data Type
Integer	Whole Number	Whole Number	Integer / INT	int64
Decimal	Decimal Number	Decimal Number	Floating point / DOUBLE	double
Currency	Fixed Decimal Number	Currency	Currency / MONEY	decimal
DateTime	DateTime, Date, Time	Date	Date / DATETIME	dateTime
Boolean	True/False	True/False	Boolean / BIT	boolean
String	Text	Text	String / NVARCHAR(MAX)	string
Variant	-	-	-	variant
Binary	Binary	Binary	Blob / VARBINARY(MAX)	binary

Operator Type	Symbol	Use	Example
Parenthesis	( )	Precedence order and grouping of arguments	(5 + 2) * 3
Arithmetic	+ − * /	Addition Subtraction/negation Multiplication Division	4 + 2 5 − 3 4 * 2 4 / 2
Comparison	= <> > >= < <=	Equal to Not equal to Greater than Greater than or equal to Less than Less than or equal to	[CountryRegion] = “USA” [CountryRegion] <> “USA” [Quantity] > 0 [Quantity] >= 100 [Quantity] < 0 [Quantity] <= 100
Text concatenation	&	Concatenation of strings	“Value is” & [Amount]
Logical	&& \|\| IN NOT	AND condition between two Boolean expressions OR condition between two Boolean expressions Inclusion of an element in a list Boolean negation	[CountryRegion] = “USA” && [Quantity]>0 [CountryRegion] = “USA” \|\| [Quantity] > 0 [CountryRegion] IN {“USA”, “Canada”} NOT [Quantity] > 0

Table	Expanded Version
Date	Date
Sales	All the tables in the entire model
Product	Product, Product Subcategory, Product Category
Product Subcategory	Product Subcategory, Product Category
Product Category	Product Category

Order Date	Delivery Date	Quantity	Date
12/31/2007	01/07/2008	100	01/07/2008
01/05/2008	01/10/2008	200	01/10/2008

Function	Table function	CALCULATE modifier
ALL	Returns all the distinct values of a column or of a table.	Removes any filter from columns or expanded tables. It never adds a filter; it only removes them if present.
ALLEXCEPT	Returns all the distinct values of a table, ignoring filters on some of the columns of the expanded table.	Removes filters from an expanded table, except from the columns (or tables) passed as further arguments.
ALLNOBLANKROW	Returns all the distinct values of a column or table, ignoring the blank row added for invalid relationships.	Removes any filter from columns or expanded tables; also adds a filter that only removes the blank row. Thus, even if there are no filters, it actively adds one filter to the context.
ALLSELECTED	Returns the distinct values of a column or a table, as they are visible in the last shadow filter context.	Restores the last shadow filter context on tables or columns, if a shadow filter context is present. Otherwise, it does not do anything. It always adds filters, even in the case where the filter shows all the values.
ALLCROSSFILTERED	Not available as a table function.	Removes any filter from an expanded table, including also the tables that can be reached directly or indirectly through bidirectional cross-filters. ALLCROSSFILTERED never adds a filter; it only removes filters if present.

Type of Relationship	Cross-filter Direction	Filter Context Propagation	Weak / Strong Type
SMR	Single	From the one side to the many side	Weak if cross-island, strong otherwise
SMR	Both	Bidirectional	Weak if cross-island, strong otherwise
SSR	Both	Bidirectional	Weak if cross-island, strong otherwise
MMR	Single	Must choose the source table	Always weak
MMR	Both	Bidirectional	Always weak

Access	Access Time	Human Metrics
1 CPU cycle	0.3 ns	1 s
L1 cache	0.9 ns	3 s
L2 cache	2.8 ns	9 s
L3 cache	12.9 ns	43 s
RAM access	120 ns	6 min
Solid-state disk I/O	50–150 µs s	2–6 days
Rotational disk I/O	1–10 ms	1–12 months

Object	Information to Collect
Table	Number of rows
Column	Number of unique values Size of dictionary Size of data (total size of all segments)
Hierarchy	Size of hierarchy structure
Relationship	Size of relationship structure

Date	Amount	Payment Type Code	Payment Type Description
2015-06-21	100	00	Cash
2015-06-21	100	02	Credit Card
2015-06-22	200	02	Credit Card
2015-06-23	200	00	Cash
2015-06-23	100	03	Wire Transfer
2015-06-24	200	02	Credit Card
2015-06-25	100	00	Cash

Precision	Cardinality
Hour	24
15 Minutes	96
5 Minutes	288
Minute	1,440
Second	86,400
Millisecond	86,400,000

Expression	Result
10 / 0	Infinity
7 / 0	Infinity
0 / 0	NaN
(10 / 0) / (7 / 0)	NaN

Query Request	Aggregation Used
Group by product brand and year	Product and Date
Group by product brand and month	Product and Date
Group by store country and year	Store and Date
Group by store country and month	Store and Date
Group by year	Product and Date (highest precedence)
Group by month	Product and Date (highest precedence)
Group by store country and product brand	No aggregation—query Sales table at detail level

Column	Memory (MB)	Distinct Values	SUM (ms)	DISTINCTCOUNT (ms)
Date	0.03	1,588	9	20
Age	165.26	96	146	333
Score	2,648.40	9,766,664	837	4,288
Time	6,493.57	1,439	1,330	4,102

Line	Subclass	Duration	CPU	Query
1	Internal	4,269	31,641	SELECT Example[Score] FROM Example;
2	Internal	4,269	31,641	SELECT Example[Score] FROM Example;
3	Internal	19	31,766	SELECT COUNT( ) FROM $DCOUNT_DATACACHE;
4	Scan	4,288	31,766	SELECT DCOUNT ( Example[Score] ) FROM Example;

Line	Subclass	Duration	CPU	Query
1	Internal	1,796	13,516	SELECT Example[Date], SUM ( Example[Amt] ), SUM ( Example[Qty] ), COUNT ( ) FROM Example;
2	Scan	1,796	13,516	SELECT Example[Date], SUM ( Example[Amt] ), SUM ( Example[Qty] ), COUNT ( ) FROM Example;
3	Internal	6	31	SELECT Example[Date], COUNT ( ) FROM Example;
4	Scan	6	31	SELECT Example[Date] FROM Example;

Line	Subclass	Query
1	Cache	SELECT Example[Date], SUM ( Example[Amt] ), SUM ( Example[Qty] ), COUNT ( ) FROM Example;
2	Scan	SELECT Example[Date], SUM ( Example[Amt] ), SUM ( Example[Qty] ), COUNT ( ) FROM Example;
3	Cache	SELECT Example[Date], COUNT ( ) FROM Example;
4	Scan	SELECT Example[Date] FROM Example;

Line	Subclass	Duration	CPU	Rows	Query
1	Internal	8,379	64,234	1	WITH $Expr0 := [CallbackDataID ( IF ( Example[Denominator] <> 0, ...
2	Scan	8,379	64,234	1	WITH $Expr0 := [CallbackDataID ( IF ( Example[Denominator] <> 0, ...

Line	Subclass	Duration	CPU	Rows	Query
1	Internal	6,790	51,984	1	WITH $Expr0 := [CallbackDataID ( IF ( Example[Denominator] <> 0, ...
2	Scan	6,790	51,984	1	WITH $Expr0 := [CallbackDataID ( IF ( Example[Denominator] <> 0, ...

Line	Subclass	Duration	CPU	Rows	Query
1	Internal	3,108	23,859	1	WITH $Expr0 := Example[Numerator] / Example[Denominator], ...
2	Scan	3,108	23,859	1	WITH $Expr0 := Example[Numerator] / Example[Denominator], ...

The Definitive Guide to DAX: Business intelligence with Microsoft Power BI, SQL Server Analysis Services, and Excel

Contents at a Glance

Contents

Foreword

Acknowledgments

Errata, updates, and book support

Stay in touch

Introduction to the second edition

Introduction to the first edition

Who this book is for

Assumptions about you

Organization of this book

Conventions

About the companion content

Chapter 1What is DAX?

Understanding the data model

Understanding the direction of a relationship

DAX for Excel users

Cells versus tables

Excel and DAX: Two functional languages

Iterators in DAX

DAX requires theory

DAX for SQL developers

Relationship handling

DAX is a functional language

DAX as a programming and querying language

Subqueries and conditions in DAX and SQL

DAX for MDX developers

Multidimensional versus Tabular

DAX as a programming and querying language

Hierarchies

Leaf-level calculations

DAX for Power BI users

Chapter 2Introducing DAX

Understanding DAX calculations

DAX data types

Integer

Decimal

Currency

DateTime

Boolean

String

Variant

Binary

DAX operators

Table constructors

Conditional statements

Understanding calculated columns and measures

Calculated columns

Measures

Choosing between calculated columns and measures

Introducing variables

Handling errors in DAX expressions

Conversion errors

Arithmetic operations errors

Empty or missing values

Intercepting errors

Generating errors

Formatting DAX code

Introducing aggregators and iterators

Using common DAX functions

Aggregation functions

Logical functions

Information functions

Mathematical functions

Trigonometric functions

Text functions

Conversion functions

Date and time functions

Relational functions

Conclusions

Chapter 3Using basic table functions

Introducing table functions

Introducing EVALUATE syntax

Understanding FILTER

Introducing ALL and ALLEXCEPT

Understanding VALUES, DISTINCT, and the blank row

Using tables as scalar values

Introducing ALLSELECTED

Conclusions

Chapter 1
What is DAX?

Chapter 2
Introducing DAX

Chapter 3
Using basic table functions

Chapter 4
Understanding evaluation contexts

Chapter 5
Understanding CALCULATE and CALCULATETABLE

Chapter 6
Variables

Chapter 7
Working with iterators and with CALCULATE

Chapter 8
Time intelligence calculations