Blog

Rightsizing Data for Athena

Amazon Athena is a service that lets you run SQL queries against structured data files stored in S3. It takes a “divide and conquer” approach, spinning up parallel query execution engines that each examine only a portion of your data. The performance of these queries, however, depends on how you consolidate and partition your data. In this post I compare query times for a moderately large dataset, looking for the “sweet spot” between number of files and individual file size.

15 Minutes With: Keith Gregory on Building a Data Pipeline for Better Business Operations

Clickstream data – the behavior data collected from a user’s path through a website or app – is often used for business intelligence reports. It helps many companies answer questions like, ‘which of my products are people adding to their cart?’ or ‘What does our online purchase funnel look like?’ But our AWS Practice Lead, Keith Gregory, argues that this data can also be used to make business operations smoother, or introduce entirely new procedures. In this interview, Keith breaks…

Building Lambdas with Poetry

Coming from a Java background, I consider the Python development process to be a bit of a mess. The pieces are all there: a central repository for publicly-available packages, a way to install the packages you want, and several ways to run your program with only those packages. But it seems that everybody has a different way to combine those pieces. So when a colleague introduced me to Poetry, my first reaction was “oh great, another tool that solves part of my problem.” But after spending time with it, I don’t want to build Lambdas any other way.

Design Thinking Crash Course: Journal My Health

Design Thinking Design Thinking is a popular design methodology that cycles through five stages to provide solutions to complex and unknown problems. At a high level, following the system will enable you to achieve the best solutions by understanding your users’ needs, redefining the problems in a human-centered perspective, thinking of lots of possible solutions, creating an approach to easily test the ideas, and then testing them. The kicker is this can be a linear process, but it probably shouldn’t…

How can we help your company with your development needs?

Contact Us