Report this Job

or email this job to apply later

Big Data Hadoop Engineer

Location: Charlotte (300 Brevard), Addison, TX, Chandler, AZ

12+ months contract to hire

Requirements:

Job Expectations:

Design and implement automated sparkbased framework to facilitate data ingestion, transformation and consumption.
Implement security protocols such as Kerberos Authentication, Encryption of data at rest, data authorization mechanism such as rolebased access control using Apache ranger.
Design and develop automated testing framework to perform data validation.
Enhance existing sparkbased frameworks to overcome tool limitations, and/or to add more features based on consumer expectations.
Design and build high performing and scalable data pipeline platform using Hadoop, Apache Spark, MongoDB, Kafka and object storage architecture.
Work with Infrastructure Engineers and System Administrators as appropriate in designing the bigdata infrastructure
Collaborate with application partners, Architects, Data Analysts and Modelers to build scalable and performant data solutions.
Effectively work in a hybrid environment where legacy ETL and Data Warehouse applications and new bigdata applications coexist
Work with Infrastructure Engineers and System Administrators as appropriate in designing the bigdata infrastructure.
Support ongoing data management efforts for Development, QA and Production environments
Provide tool support, help consumers troubleshooting pipeline issues.
Utilizes a thorough understanding of available technology, tools, and existing designs.
Leverage knowledge of industry trends to build best in class technology to provide competitive advantage.

Required Qualification:

5 plus years of experience of software engineering experience
5 plus years of experience delivering complex enterprise wide information technology solutions
5 plus years of experience delivering ETL, data warehouse and data analytics capabilities on bigdata architecture such as Hadoop
5 plus years of Apache Spark design and development experience using Scala, Java, Python or Data Frames with Resilient Distributed Datasets (RDDs), Parquet or ORC file formats
6 plus years of ETL (Extract, Transform, Load) Programming experience
2 plus years of Kafka or equivalent experience
2 plus years of NoSQL DB like Couchbase/MongoDB experience.
5 plus experience working with complex SQLs and performance tuning

Desired Qualification:

3 plus years of Agile experience
2 plus years of reporting experience, analytics experience or a combination of both
2 plus years of operational risk or credit risk or compliance domain experience
2 plus years of experience integrating with RESTful API
2 plus years of experience with CICD tools.

Manager Notes:

Team Duties and business impact: The Risk Data Services is a horizontal function within Risk Technology organization and is responsible for delivering data consistently across Risk. The Risk Data Services team is seeking a Lead Software Engineer. Driving strategy for the team for entire platform and help with migration to cloud.
Big Data Hub, the tools are used purely for ETL functions, moving and conforming data, not for building web applications. The use of Python and Java is for Data Engineering, we are not seeking Web Developers, back and front end etc.

Connvertex Technologies Inc.

Apply Online

or email this job to apply later

	Search millions of jobs
Jobvertise