Description

Roles & Responsibilities:

Python/Spark based Data Scientists and Data Engineer.
Previous experience 7-10+ as a big data engineer.
In-depth knowledge of Hadoop (Cloudera), Spark, and similar frameworks.
Good knowledge of Big Data querying tools, such as Pig, Hive, and Impala
Knowledge of scripting languages including Java, C++, Linux, Ruby, PHP, Python, and R.
Own most deliverables for the Big Data team from a delivery perspective.
Ability to solve complex networking, data, and software issues.
Able to Effectively Plan & Organize Their Work
Strong Interpersonal Communication
Assist others in the completion of their tasks to support the group goals.
Build and maintain cooperative work relationships with others


Experience Required:

Extensive experience in Big Data space (Hadoop Stack like M/R, HDFS, Pig, Hive, HBase, Flume, Sqoop, NoSQL stores like Cassandra, HBase etc.) across Fractal and contributes to open-source Big Data technologies.
Write and tune complex Java, MapReduce, and Hive jobs.
Experience leading a Backend/Distributed Data Systems team while remaining hands-on is very important.
Manage the business intelligence team and vendor partners, ensuring to prioritize projects according to customer and internal needs, and develops top-quality dashboards using industry best practices.
Manage team of data engineers (both full-time associates and/or third-party resources)
Analyzes and confirms the integrity of source data to be evaluated.
Leads in deployment and auditing models and attributes for accuracy.
Experience with stream-processing systems: Spark-Streaming, Strom etc.
Experience with object-oriented/object function scripting languages: Python, Scala etc.
Experience in designing and building dimensional data models to improve accessibility, efficiency, and quality of data.
Should be proficient in writing Advanced SQLs, Expertise in performance tuning of SQLs. Experience with data science and machine learning tools and technologies is a plus.
Experience with relational SQL and NoSQL databases, including Postgres and Cassandra.
Experience with Azure cloud services is a plus.
Financial Services Knowledge is a plus.

Education

Bachelor's degree in Computer Science