Skip to content
This repository has been archived by the owner on Apr 29, 2024. It is now read-only.

Latest commit

 

History

History
73 lines (67 loc) · 1.94 KB

CHANGES.md

File metadata and controls

73 lines (67 loc) · 1.94 KB

Changes in dataduct

0.5.1

  • update to pyyaml so awscli won't puke

0.5.0

  • Cleanup commands being passed in QA steps
  • Add support for postgres
  • Status bar for uploading large files
  • Minor bug fixes

0.4.0

  • Support for starting database shell from dataduct cli
  • Fix bug in logger configuration
  • More performance tuning for analyze and vacumm
  • Improved subject line for SNS messages
  • More informed logging for load errors
  • Improvements to decorators
  • PK enforcement changes
  • New load-reload-pk step
  • Support for worker groups
  • Steps to move away from scripts to all code being contained in the library

0.3.0

  • More documentation
  • Bug fixes in SQL parser
  • Hooks framework
  • Default bootstrap
  • Teardown
  • Frequency fixes

0.2.0

  • Travis integration for continous builds
  • QA steps and logging to S3
  • Visualizing pipeline
  • Dataduct CLI updated as a single entry point
  • RDS connections for scripts
  • Bootstrap step for pipelines
  • Backfill or delay activation
  • Output path and input path options
  • Script directory for transform step
  • SQL sanatization for DBA actions
  • SQL parser for select and create table statements
  • Logging across the library
  • Support for custom steps
  • Pipeline dependency step
  • Reduce verbosity of imports
  • Step parsing is isolated in steps
  • More examples for steps
  • Sync config with S3
  • Config overides with modes
  • Rename keywords and safe config failure handling
  • EMR Streaming support with hadoop 2
  • Exceptions cleanup
  • Read the docs support
  • Creating tables automatically for various steps
  • History table support
  • EC2 and EMR config control from YAML
  • Slack integration
  • Support for Regions in DP

0.1.0

  • Initial version of the dataduct library released
  • Support for the following steps:
    • emr_streaming
    • extract-local
    • extract-s3
    • extract-rds
    • extract-redshift
    • load-redshift
    • sql-command
    • transform
  • Examples and documentation added for all the steps