We are seeking a skilled Cloudera Engineer with 3+ years of experience in managing and optimizing Cloudera-based environments. The ideal candidate will be responsible for the installation, configuration, maintenance, and performance tuning of Cloudera's suite of products to ensure the stability and efficiency of our data platform.
Key Responsibilities:
- Cloudera Infrastructure Management: Setup, configuration, and management of Cloudera Hadoop clusters, including HDFS, YARN, Hive, Impala, HBase, and Kafka components.
- Performance Optimization: Implement performance tuning strategies, capacity planning, and troubleshooting to maintain optimal performance across the Cloudera ecosystem.
- Security and Compliance: Implement security policies, access controls, and ensure compliance standards (e.g., GDPR, HIPAA) within the Cloudera environment.
- Monitoring and Maintenance: Establish monitoring tools and practices for system health, perform routine maintenance, upgrades, and patching of Cloudera clusters.
- Backup and Recovery: Develop and implement backup and disaster recovery strategies for Cloudera clusters to ensure data integrity and availability.
- Documentation and Training: Create and maintain documentation, best practices, and provide guidance to other team members or end-users on Cloudera usage.
Key Skills and Qualifications:
- Cloudera Expertise: Proven hands-on experience in managing Cloudera distributions (CDH/Cloudera Data Platform).
- Hadoop Ecosystem: Strong understanding of Hadoop ecosystem components and their interactions.
- Linux Administration: Proficiency in Linux/Unix system administration and shell scripting.
- Performance Tuning: Experience in performance optimization, troubleshooting, and capacity planning for Cloudera clusters.
- Security and Compliance: Knowledge of security practices, access controls, and experience in implementing security measures within Cloudera.
- Communication and Collaboration: Strong communication skills, ability to collaborate within cross-functional teams.
Preferred Skills:
- Certifications: Cloudera Administrator Certification (CCA) or other relevant certifications.
- Automation Tools: Familiarity with automation tools (e.g., Ansible, Chef, Puppet) for Cloudera infrastructure management.
- Streaming Technologies: Exposure to real-time data streaming technologies like Kafka or Spark Streaming.
- Cloud Experience: Understanding of Cloudera integration with cloud platforms (AWS, Azure, GCP).