Data Architect / Data Engineer in Durham, NC at Consultis

Date Posted: 5/17/2018

Job Snapshot

Job Description

Data Engineer to build Data Lake in AWS 

* Technical lead for team working with the domain teams related to data integration services 
* Implement a high-performance data platform for fast ELT processing and curation. 
* Refactor legacy data platforms to integrate with the high-performance data platform. 
* Implement data quality infrastructure and processes. 
* Define standards for and implement improved analytics tools. 
* Coordinate the implement of data infrastructure for emerging data classes. 
* Implement support structure for standard tools. 
* Identify analytics tool guidelines and standards for performing required data preparation, analysis, visualization, and reporting. 
* Provide business and architectural context to show how the analytics tools fit within the overall infrastructure and high-performance data architecture. 
* Organize, deliver, and ensure data integration support of a scientific computing capability. 

Expert in: 
* Amazon web services, specificly Ec2, Glue, EMR, Firehose, Athena, Lambda 
* Hadoop ecosystem, including Map-reduce, hive, impala, hbase, spark 
* Data modeling and design 
* Data architecture patterns and best practices 
* Coding: python, java, scripting 
* RDBMS (oracle, sql server, postgres) 

* nosql 
* Grid/qsub 
* RDBMS administration 
* data visualization tools (qlik, spotfire, tableau, etc) 
* data wrangling tools (alteryx, etc) 
* BI Tools 
* Modeling, Machine Learning & Bayesian statistics 
* node.js 

Ag or Life sciences domain experience highly desirable.


  1. Architect Jobs
  2. Systems Engineer Jobs