1. Transfer files from S3 buckets to GCS
  2. Create / Start a CE Instance
  3. Mount extra storage as needed to hold the large Adobe zip files
  4. Copy the files from GCS to compute instance-1. Try one first to test and make sure everything works before copying everything.
  5. Unzip and untar the Adobe files
  6. Create the JSON schema reflecting the field names for "hit_data.tsv" with "column_header.tsv". Follow GBQ schema docs for schema file format.
  7. Load hit data with command: bq load -F "\t" <GBQ DataSet.TableName> <Source Data> <Schema File>

results matching ""

    No results matching ""