SMALLINT data type
A 2-byte integer data type used in CREATE TABLE
and ALTER
TABLE
statements.
Syntax:
In the column definition of a CREATE TABLE
statement:
column_name SMALLINT
Range: -32768 .. 32767. There is no UNSIGNED
subtype.
Conversions: Impala automatically converts to a larger integer type (INT
or
BIGINT
) or a floating-point type (FLOAT
or DOUBLE
)
automatically. Use CAST()
to convert to TINYINT
, STRING
,
or TIMESTAMP
.
Casting an integer or floating-point
value N
to TIMESTAMP
produces a
value that is N
seconds past the start of the epoch
date (January 1, 1970). By default, the result value represents a date
and time in the UTC time zone. If the setting
‑‑use_local_tz_for_unix_timestamp_conversions=true
is in effect, the resulting TIMESTAMP
represents a
date and time in the local time zone.
Usage notes:
For a convenient and automated way to check the bounds of the SMALLINT
type, call the
functions MIN_SMALLINT()
and MAX_SMALLINT()
.
If an integer value is too large to be represented as a SMALLINT
, use an
INT
instead.
NULL considerations: Casting any non-numeric value to this type produces a NULL
value.
Examples:
CREATE TABLE t1 (x SMALLINT); SELECT CAST(1000 AS SMALLINT);
Parquet considerations:
Physically, Parquet files represent TINYINT
and SMALLINT
values as 32-bit
integers. Although Impala rejects attempts to insert out-of-range values into such columns, if you create a
new table with the CREATE TABLE ... LIKE PARQUET
syntax, any TINYINT
or
SMALLINT
columns in the original table turn into INT
columns in the new
table.
Partitioning: Prefer to use this type for a partition key column. Impala can process the numeric
type more efficiently than a STRING
representation of the value.
HBase considerations: This data type is fully compatible with HBase tables.
Text table considerations: Values of this type are potentially larger in text tables than in tables using Parquet or other binary formats.
Internal details: Represented in memory as a 2-byte value.
Added in: Available in all versions of Impala.
Column statistics considerations: Because this type has a fixed size, the maximum and average size
fields are always filled in for column statistics, even before you run the COMPUTE STATS
statement.