MODEL Clause: PARTITON BY

In this tutorial we explore the PARTITION component of the MODEL clause. The Oracle® Database Data Warehousing Guide 10g Release 2 (10.2) - Chapter 22 describes partition columns this way.

"Partition columns define the logical blocks of the result set in a way similar to the partitions of the analytical functions described in Chapter 21, "SQL for Analysis and Reporting". Rules in the MODEL clause are applied to each partition independent of other partitions."

"You can partition data and evaluate rules within each partition independent of other partitions. This enables parallelization of model computation based on partitions."

Performance

If your system has multiple processors the PARTITION BY clause may improve your query's performance (see Parallel Execution). For example, with a query like this

break on group_1 skip 1 duplicates

select
  group_1 ,
  group_2 ,
  num_val ,
  m_1
from
  t
model
  dimension by ( group_1, group_2 )
  measures     ( num_val, 0 m_1 )
  rules        ( m_1[any, any] = num_val[cv(), cv()] * 10 )
order by
  group_1 ,
  group_2
;

GROUP_1    GROUP_2    NUM_VAL        M_1
---------- ---------- ------- ----------
A          a1             100       1000
A          a2             200       2000
A          a3             300       3000

B          a1
B          a2             300       3000
B          a3             100       1000

C          a1             100       1000
C          a2

           a1             200       2000
           a2             800       8000

one CPU will most likely process the M_1 rule for all rows in the result set (note: I have not tested this behaviour myself; this information is assumed from the manuals). Using a query like the following produces the same output while allowing one processor on your system to process the rows where GROUP_1 is 'A', another CPU to process those where GROUP_1 is 'B', etc.

select
  group_1 ,
  group_2 ,
  num_val ,
  m_1
from
  t
model
  PARTITION BY ( GROUP_1 )
  dimension by ( group_2 )
  measures     ( num_val, 0 m_1 )
  rules        ( m_1[any] = num_val[cv()] * 10 )  -- note only one dimension now
order by
  group_1 ,
  group_2
;

GROUP_1    GROUP_2    NUM_VAL        M_1
---------- ---------- ------- ----------
A          a1             100       1000
A          a2             200       2000
A          a3             300       3000

B          a1
B          a2             300       3000
B          a3             100       1000

C          a1             100       1000
C          a2

           a1             200       2000
           a2             800       8000

While PARTITION BY had no visible effect on the output in the last example, there are situations where it will have an effect. These are described next.

Uniqueness of Dimension Columns

In the DIMENSION BY tutorial we saw that a DIMENSION BY clause whose columns do not uniquely identify each result row generate an error.

select
  group_2 ,
  num_val
from
  t
model
  dimension by ( group_2 ) -- group_2 is not unique
  measures     ( num_val )
  rules        ()
order by
  group_2
;
  t
  *
ERROR at line 5:
ORA-32638: Non unique addressing in MODEL dimensions

With a partition clause this no longer happens, as long as the dimension values are unique within a partition.

select
  group_2 ,
  num_val
from
  t
model
  PARTITION BY ( GROUP_1 )
  dimension by ( group_2 ) -- group_2 values are unique within group_1 values
  measures     ( num_val )
  rules        ()
order by
  group_2
;

GROUP_2    NUM_VAL
---------- -------
a1             100
a1             100
a1
a1             200
a2             800
a2             200
a2
a2             300
a3             100
a3             300

If your dimension values are not unique within a partition you will get an error.

select
  group_1 ,
  date_val ,
  num_val
from
  t
model
  partition by ( group_1 )
  dimension by ( DATE_VAL ) -- there are duplicate date values in group_1
  measures     ( num_val )
  rules        ()
order by
  group_1 ,
  date_val
;
  t
  *
ERROR at line 6:
ORA-32638: Non unique addressing in MODEL dimensions

Aggregate and Analytic Functions

PARTITION BY also effects query results with aggregate and analytic functions. This effect is described in the Aggregate and Analytic Expressions tutorial.

SQL Snippets ™: Tutorials for Oracle Developers

MODEL Clause

PARTITON BY

Performance

Uniqueness of Dimension Columns

Aggregate and Analytic Functions

Linking to SQL Snippets ™