Data Engineer
About Toptal
Toptal is one of the most innovative and rapidly expanding tech start-ups from Silicon Valley. With backing from investors such as Andreessen Horowitz and Adam D’Angelo, our platform is the fastest growing labor marketplace in the history of the Internet—connecting thousands of elite engineers and designers all over the world. In the last five years, Toptal has become the #1 choice for tech companies requiring top-tier engineering and design talent and for the top 3% of freelancers looking for their next challenge.
While we’re primarily focused on bringing quality and value to our clients, we’re also committed to creating a world-class environment for our employees. We are a completely distributed company with thousands of core and network team members located all over the world, and we take the best elements of virtual teams and combine them with a support structure that encourages innovation, social interaction, and fun. We take an all-hands-on-deck approach to our work, taking pride in being collaborative, creative, and flexible.
If you aren’t looking for a job because you’re already killing it, we want you.
Position Description
At Toptal, we measure everything and always rely on data to guide all of our initiatives, including both our long-term strategy and our day-to-day operations.
As a Data Engineer, your main goal is to be one step ahead of data scientists and analysts, and support them by providing infrastructure and tools they can use to deliver end-to-end solutions to business problems that can be developed rapidly and maintained easily. This is more than building and maintaining ETL pipelines. We need innovation, creativity and solutions that will have significant impact on our velocity. We, in turn, will give you autonomy and freedom to turn your ideas into reality.
This is a remote position that can be done from anywhere. However, we do things like rent out hotels in Africa or mansions in Thailand, and you will certainly be invited to come work with us.
Responsibilities:
- Build scalable, highly performant infrastructure for delivering clear business insights from a variety of raw data sources.
- Develop batch & real-time analytical solutions, prototypes, and proofs of concept for selected solutions.
- Implement complex analytical projects with a focus on collecting, managing, analyzing, and visualizing data.
- Build frameworks and tools to empower our data scientists and analysts.
- Be in constant communication with team members and other relevant parties and convey results efficiently and clearly.
Requirements:
- Working experience with Python, Pandas. Prior experience with Luigi is a plus.
- Working experience with Scala and Spark is a big plus.
- Familiarity with Google Cloud Platform (e.g. GCS and BigQuery) is a plus.
- Working experience with Ruby and Rails is a plus.
- Familiarity with the basic principles of distributed computing and data modeling.
- Extensive experience with object-oriented design and coding and testing patterns, including experience with engineering software platforms and data infrastructures. Familiarity with functional programming concepts is a plus.
- Outstanding communication and interpersonal skills.
- Be excited about collaborating daily with your team and other groups while working via a distributed model.
- Be eager to help your teammates, share your knowledge with them, and learn from them.
- Be open to receiving constructive feedback.
- You must be a world-class individual contributor to thrive at Toptal.