This section provides you the details on various fields required to configure Amazon S3 datasource in Data Integrity Suite.
| Field | Description |
|---|---|
| Connection name | Specifies the name of data connection. This is a mandatory field. Example: AWS S3 DataConnection |
| Description | Allows you to describe the purpose of this data connection. Example: Connection to the s3 database for cataloging sales tables. |
| Agent to access data | Select the agent from the dropdown that connects to the database. It helps securely access data from your network. |
| Host name | Specifies the web address or endpoint used to
connect to the S3 service. Example: s3.amazonaws.com or s3.us-west-2.amazonaws.com |
| Region | Specifies the geographical area where your S3 buckets are hosted. Example: us-east-1, us-west-2 or eu-west-1 |
|
Advanced Options |
For advanced options, refer to the debugging parameters listed in the table below. |
Debugging parameters
Advanced options include several debugging parameters that allow you to customize how your CSV and JSON files are processed in Amazon S3. These settings control aspects such as:
- Field delimiters
- Header row usage
- Multiline record parsing
- Encoding types
Proper configuration of these parameters helps ensure your data is read and cataloged accurately.
| Parameter | Default | Description |
|---|---|---|
csv.catalog.delimiter |
, | Sets a separator for each field and value. This separator can be of one or more characters. |
csv.catalog.header
|
true |
For csv, this parameter uses the first line as names of columns. |
json.catalog.multiLine |
false for .json files default is true for .jsonl files default is false |
Parse one record, which may span multiple lines, per file. Set to false if one record does not span multiple lines. Set to true if one record spans to multiple lines. |
json.catalog.encoding |
UTF-8 | For reading, allows to forcibly set one of standard basic or extended encoding for the JSON files. |
csv.catalog.encoding |
UTF-8 | Decodes the CSV files by the given encoding type. |
csv.catalog.quote |
" | Sets a single character used for escaping quoted values where the separator can be part of the value. |
csv.catalog.linesep |
\n | Defines the line separator that should be used for parsing/writing |