Skip to content

Release 61 Pygmy Parrot

Compare
Choose a tag to compare
@alexanderdean alexanderdean released this 02 Mar 13:13
· 2495 commits to master since this release

The major additions are:

  1. You can now parse Amazon CloudFront access logs using Snowplow
  2. The latest Clojure Collector version supports Tomcat 8 and CORS for cross-domain POST from JavaScript and ActionScript
  3. EmrEtlRunner's failure handling and Clojure Collector log handling have been improved

Common

  • Bumped VERSION file to r61-pygmy-parrot
  • Added Gradle to up.playbooks (#1270)
  • Added .travis.yml file and Travis button to repo (#1359)
  • Added Release button to README (#1428)
  • Added License button to README (#1427)

Clojure Collector

  • Bumped to 1.0.0
  • Updated access-valve to depend on Tomcat 8 classes (#1203)
  • Updated .ebextensions to depend on Tomcat 8 (#1202)
  • Added ability to disable third-party cookies (#1362)
  • Added CORS support (#1146)
  • Added CORS-style support for ActionScript3 Tracker (#1330)
  • Added support for /:vendor/:version to HEAD (#1166)
  • Now using UTF-8 for character encoding throughout (#1354)

Scala Common Enrich

  • Bumped to 0.12.0
  • Updated SnowplowAdapter to accept "charset=UTF-8" (#1424)
  • Base64 decoding does not specify UTF-8 charset (#1403)
  • Removed incorrect extra layer of URL decoding from non-Bas64-encoded JSONs (#1396)
  • Added support for ti_nm for transaction item name as well as ti_na (#1401)
  • Added CloudfrontAccessLogAdapter (#1282)
  • Made timestamp field of CollectorPayload an Option (#1417)

Scala Hadoop Enrich

  • Bumped to 0.13.0
  • Bumped Scala Common Enrich to 0.12.0 (#1395)
  • Added test for non-Base64-encoded JSON (#1394)
  • Updated tests to include Unicode (#1390)
  • Added integration test for CloudfrontAccessLogAdapter (#1423)

Scala Hadoop Bad Rows

  • Removed .travis.yml (#1382)

EmrEtlRunner

  • Bumped to 0.12.0
  • Now appending region name to Clojure Collector log files (#1379)
  • Added support for moving and archiving timestamped Clojure Collector log files (#1400)
  • Now appending rather than prepending instance names to Clojure Collector log files (#1404)
  • Changed Clojure Collector log timestamp format to match CloudFront logs (#1398)
  • Added dedicated return code for no files to process (#1397)
  • Now allowing tsv// and json// as :etl:collector_format (#1284)
  • Now performing S3DistCp from processing for tsv/com.amazon.aws.cloudfront/* (#1431)
  • Added output directory empty check prior to staging step (#1151)

StorageLoader

  • Updated shell script to only run StorageLoader if EmrEtlRunner found files (#1399)
  • Wrote JSON Path file for a com.snowplowanalytics.snowplow/flash_context (#1305)
  • Wrote JSON Path file for a com.snowplowanalytics.snowplow/timing event (#1388)
  • Wrote JSON Path file for a com.amazon.aws.cloudfront/wd_access_log event (#1285)
  • Wrote JSON Path file for a com.google.analytics/cookies context (#1409)
  • Wrote JSON Path file for a com.snowplowanalytics.snowplow/desktop_context (#1421)

Redshift

  • Added Redshift DDL for a com.snowplowanalytics.snowplow/timing event (#1387)
  • Added Redshift DDL for a com.snowplowanalytics.snowplow/flash_context (#1304)
  • Added Redshift DDL for a com.amazon.aws.cloudfront/wd_access_log event (#1286)
  • Added Redshift DDL for a com.google.analytics/cookies context (#1408)
  • Added Redshift DDL for a com.snowplowanalytics.snowplow/desktop_context (#1420)