Download User`s Guide - Oracle Documentation
Transcript
Configuring Oracle SQL Connector for HDFS
oracle.hadoop.exttab.dataPaths
Specifies a comma-separated list of fully qualified
HDFS paths. This property enables you to restrict the
input by using special pattern-matching characters in
the path specification. See Table 2-5. This property is
required for the -createTable and -publish
commands using Data Pump or delimited text files.
The property is ignored for Hive data sources.
For example, to select all files in /data/s2/, and only
the CSV files in /data/s7/, /data/s8/, and /data/
s9/, enter this expression:
/data/s2/,/data/s[7-9]/*.csv
The external table accesses the data contained in all
listed files and all files in listed directories. These files
compose a single data set.
The data set can contain compressed files or
uncompressed files, but not both.
Table 2-5
Pattern-Matching Characters
Character
Description
?
Matches any single character
*
Matches zero or more characters
[abc]
Matches a single character from the
character set {a, b, c}
[a-b]
Matches a single character from the
character range {a...b}. The character a must
be less than or equal to b.
[^a]
Matches a single character that is not from
character set or range {a}. The carat (^) must
immediately follow the left bracket.
\c
Removes any special meaning of character
c. The backslash is the escape character.
{ab\,cd}
Matches a string from the string set {ab, cd}.
Precede the comma with an escape
character (\) to remove the meaning of the
comma as a path separator.
{ab
Matches a string from the string set {ab, cde,
cfh}. Precede the comma with an escape
character (\) to remove the meaning of the
comma as a path separator.
\,c{de
\,fh}}
Oracle SQL Connector for Hadoop Distributed File System 2-37