answered By vishnoiprem 0

The data distribution of table among AMPs is called Skew Factor .

Generally For Non-Unique PI we get duplicate values so the more duplicate vales we get more the data have same row hash so all the same data will come to same amp, it makes data distribution inequality,One amp will store more data and other amp stores less amount of data, when we are accessing full table,

The amp which is having more data will take longer time and makes other amps waiting which leads processing wastage In this situation (unequal distribution of data)we get Skew Factor HighFor this type of tables we should avoid full table scans.

ex:AMP0 AMP110000(10%) 9000000(90%)in this situation skew factor is very high 90%

Generally For Non-Unique PI we get duplicate values so the more duplicate vales we get more the data have same row hash so all the same data will come to same amp, it makes data distribution inequality,One amp will store more data and other amp stores less amount of data, when we are accessing full table,

The amp which is having more data will take longer time and makes other amps waiting which leads processing wastage In this situation (unequal distribution of data)we get Skew Factor HighFor this type of tables we should avoid full table scans.

ex:AMP0 AMP110000(10%) 9000000(90%)in this situation skew factor is very high 90%

```
Query to find skew factor of a particular table?
SELECT
TABLENAME
,SUM(CURRENTPERM) /1024/1024 AS CURRENTPERM,
(100 - (AVG(CURRENTPERM)/MAX(CURRENTPERM)*100)) AS SKEWFACTOR
FROM
DBC.TABLESIZE
WHERE DATABASENAME= <DATABASENAME>
AND
TABLENAME =<TABLENAME>
GROUP BY 1;
```

flag

Write A Tutorials

Question tags

Similar questions

Follow this question

Follow teradata category

Question tags

teradata
× 1

Asked
**2 years and 3 months ago ago**

Number of Views -1932

Number of Answers -1

Last updated

**3 years and 2 months ago ago**

Number of Views -1932

Number of Answers -1

Last updated

Similar questions