Rechercher dans le manuel MySQL

8.2.1.9 Outer Join Simplification

Table expressions in the FROM clause of a query are simplified in many cases.

At the parser stage, queries with right outer join operations are converted to equivalent queries containing only left join operations. In the general case, the conversion is performed such that this right join:

  1. (T1, ...) RIGHT JOIN (T2, ...) ON P(T1, ..., T2, ...)

Becomes this equivalent left join:

  1. (T2, ...) LEFT JOIN (T1, ...) ON P(T1, ..., T2, ...)

All inner join expressions of the form T1 INNER JOIN T2 ON P(T1,T2) are replaced by the list T1,T2, P(T1,T2) being joined as a conjunct to the WHERE condition (or to the join condition of the embedding join, if there is any).

When the optimizer evaluates plans for outer join operations, it takes into consideration only plans where, for each such operation, the outer tables are accessed before the inner tables. The optimizer choices are limited because only such plans enable outer joins to be executed using the nested-loop algorithm.

Consider a query of this form, where R(T2) greatly narrows the number of matching rows from table T2:

  1. SELECT * T1 LEFT JOIN T2 ON P1(T1,T2)
  2.   WHERE P(T1,T2) AND R(T2)

If the query is executed as written, the optimizer has no choice but to access the less-restricted table T1 before the more-restricted table T2, which may produce a very inefficient execution plan.

Instead, MySQL converts the query to a query with no outer join operation if the WHERE condition is null-rejected. (That is, it converts the outer join to an inner join.) A condition is said to be null-rejected for an outer join operation if it evaluates to FALSE or UNKNOWN for any NULL-complemented row generated for the operation.

Thus, for this outer join:

  1. T1 LEFT JOIN T2 ON T1.A=T2.A

Conditions such as these are null-rejected because they cannot be true for any NULL-complemented row (with T2 columns set to NULL):

  1. T2.B > 3
  2. T2.C <= T1.C
  3. T2.B < 2 OR T2.C > 1

Conditions such as these are not null-rejected because they might be true for a NULL-complemented row:

  1. T2.B IS NULL
  2. T1.B < 3 OR T2.B IS NOT NULL
  3. T1.B < 3 OR T2.B > 3

The general rules for checking whether a condition is null-rejected for an outer join operation are simple:

  • It is of the form A IS NOT NULL, where A is an attribute of any of the inner tables

  • It is a predicate containing a reference to an inner table that evaluates to UNKNOWN when one of its arguments is NULL

  • It is a conjunction containing a null-rejected condition as a conjunct

  • It is a disjunction of null-rejected conditions

A condition can be null-rejected for one outer join operation in a query and not null-rejected for another. In this query, the WHERE condition is null-rejected for the second outer join operation but is not null-rejected for the first one:

  1. SELECT * FROM T1 LEFT JOIN T2 ON T2.A=T1.A
  2.                  LEFT JOIN T3 ON T3.B=T1.B
  3.   WHERE T3.C > 0

If the WHERE condition is null-rejected for an outer join operation in a query, the outer join operation is replaced by an inner join operation.

For example, in the preceding query, the second outer join is null-rejected and can be replaced by an inner join:

  1. SELECT * FROM T1 LEFT JOIN T2 ON T2.A=T1.A
  2.                  INNER JOIN T3 ON T3.B=T1.B
  3.   WHERE T3.C > 0

For the original query, the optimizer evaluates only plans compatible with the single table-access order T1,T2,T3. For the rewritten query, it additionally considers the access order T3,T1,T2.

A conversion of one outer join operation may trigger a conversion of another. Thus, the query:

  1. SELECT * FROM T1 LEFT JOIN T2 ON T2.A=T1.A
  2.                  LEFT JOIN T3 ON T3.B=T2.B
  3.   WHERE T3.C > 0

Is first converted to the query:

  1. SELECT * FROM T1 LEFT JOIN T2 ON T2.A=T1.A
  2.                  INNER JOIN T3 ON T3.B=T2.B
  3.   WHERE T3.C > 0

Which is equivalent to the query:

  1. SELECT * FROM (T1 LEFT JOIN T2 ON T2.A=T1.A), T3
  2.   WHERE T3.C > 0 AND T3.B=T2.B

The remaining outer join operation can also be replaced by an inner join because the condition T3.B=T2.B is null-rejected. This results in a query with no outer joins at all:

  1. SELECT * FROM (T1 INNER JOIN T2 ON T2.A=T1.A), T3
  2.   WHERE T3.C > 0 AND T3.B=T2.B

Sometimes the optimizer succeeds in replacing an embedded outer join operation, but cannot convert the embedding outer join. The following query:

  1.               (T2 LEFT JOIN T3 ON T3.B=T2.B)
  2.               ON T2.A=T1.A
  3.   WHERE T3.C > 0

Is converted to:

  1.               (T2 INNER JOIN T3 ON T3.B=T2.B)
  2.               ON T2.A=T1.A
  3.   WHERE T3.C > 0

That can be rewritten only to the form still containing the embedding outer join operation:

  1.               (T2,T3)
  2.               ON (T2.A=T1.A AND T3.B=T2.B)
  3.   WHERE T3.C > 0

Any attempt to convert an embedded outer join operation in a query must take into account the join condition for the embedding outer join together with the WHERE condition. In this query, the WHERE condition is not null-rejected for the embedded outer join, but the join condition of the embedding outer join T2.A=T1.A AND T3.C=T1.C is null-rejected:

  1.               (T2 LEFT JOIN T3 ON T3.B=T2.B)
  2.               ON T2.A=T1.A AND T3.C=T1.C
  3.   WHERE T3.D > 0 OR T1.D > 0

Consequently, the query can be converted to:

  1.               (T2, T3)
  2.               ON T2.A=T1.A AND T3.C=T1.C AND T3.B=T2.B
  3.   WHERE T3.D > 0 OR T1.D > 0

Find a PHP function

Document created the 26/06/2006, last modified the 26/10/2018
Source of the printed document:https://www.gaudry.be/en/mysql-rf-outer-join-simplification.html

The infobrol is a personal site whose content is my sole responsibility. The text is available under CreativeCommons license (BY-NC-SA). More info on the terms of use and the author.

References

  1. View the html document Language of the document:en Manuel MySQL : https://dev.mysql.com/

These references and links indicate documents consulted during the writing of this page, or which may provide additional information, but the authors of these sources can not be held responsible for the content of this page.
The author This site is solely responsible for the way in which the various concepts, and the freedoms that are taken with the reference works, are presented here. Remember that you must cross multiple source information to reduce the risk of errors.

Contents Haut