Job Description:
Development and maintenance of Huawei Big Data Stack. Extraction of structured and unstructured data from different sources and create structured Data Models. Daily work will be on Hive, Spark & Hadoop and other related tools and technologies.
Develop features for recommendation to improve the business metrics like CTR, CVR etc.
Develop reports from the data based on the business requirements. Develop and Maintain systems for Self-Analytics and other Visualization requirements.
Maintain and Manage the daily running of the ETL jobs and fix the data pipeline issues, focus on “cost, compute, storage” optimization tasks on day to day basis.
Deploy Deep Learning Models, maintain and improve their prediction accuracy through data quality, feature enhancement and hyperparameter tuning.
Skills / Qualifications:
Master Degree/Bachelors in Computer Science or other related fields with coding and development background in the data base queries and related products.
Strong Working Experience with Hive, SQL, data modeling, and at least one programming language (Python, Java, Scala).
Strong working experience in big data technologies such as Apache Spark, Hive, Hadoop and Linux.
Strong capability to coalesce and present complex data sets from different sources to different data consumers.
Working Experience with one or more of the following: data processing automation, data quality, data warehousing, data governance, business intelligence, data visualization, data privacy.
Working experience in determining and implementing the security models based on privacy requirements, confirm safeguards are followed, address data quality issues, and evolve governance processes within allocated areas of ownership.
Solve data integration problems, utilizing optimal ETL patterns, frameworks, query techniques, sourcing from structured and unstructured data sources
Assist in owning existing processes running in production, optimizing complex code through advanced algorithmic concepts
Maintain the data ensure data is valid, avoid data duplicates before exposing.
Experience in data visualization and report generation, for AB Test and Model Performance.
Next Step
Click “apply” or send resume to: Ryce [email protected]
EA Licence No.91C2918| Personnel Registration No. R23117258