Open position at MSD IT

Big Data Engineer

Work schedule
Full-time
Address
Svornosti 3321/2, 150 00 Smíchov, Czechia

Responsibilities:

  • Responsible for implementation and ongoing administration of Hadoop infrastructure
  • Cluster maintenance including creation and removal of nodes
  • Performance tuning of Hadoop clusters
  • Monitoring of Hadoop cluster connectivity and security
  • Hadoop services support and maintenance - HDFS, Hive, HBase and Kafka
  • Software patches and upgrades
  • Automation of manual tasks using Ansible
  • Collaborating with application teams to install operating system and Hadoop updates, patches and version upgrades
  • Deployment of Hadoop cluster, add and remove nodes, keep track of jobs, monitor critical parts of the cluster, configure high availability
  • Research and recommend technical and operational improvements for improved reliability and efficiencies

 

Requirements:

  • Strong experience with UNIX/LINUX based systems & scripting (either of Bash or Python)
  • Knowledge of Hadoop ecosystem - YARN, MapReduce, HDFS, HBase, Zookeeper, Kafka, Spark, Hive
  • Experience with configuration management tools such as Ansible, Puppet, Chef or Salt
  • Knowledge of directory services such as LDAP & ADS
  • Knowledge of monitoring tools such as Nagios or Icinga2
  • Distributed systems troubleshooting skills
  • Ability to communicate in English

 

Bonus:

  • Experience with configuring security in Hadoop using Kerberos or PAM
  • Experience with cloud services such as AWS
  • Experience troubleshooting Java applications
  • Experience with agile development

Get in touch with Nikola Kalčić - nikola.kalcic@merck.com.