I’m currently using redshift and I’m trying to distribute a 8M dataset into multiple buckets of 1M distributing by earliest time stamp to most recent.
ex: If my data set is 8 years of data and evenly distributed, I would expect bucket1 to be data from year0 to year1; bucket2 to be year1 to year3, and so on.
I’m also looking to build something like:
BucketX | Start | End |
---|---|---|
Bucket1 | year0 | year1 |
Bucket2 | year1 | Year2 |