Data Engineer
Responsibilities: Gather requirements for model parameters. Build feature extraction scripts to automate the process for the ML model. Collaborate with product owners/technical leaders to integrate ML models into products/services. Process data from streaming/raw data based on user needs. Design and maintain scalable and reliable data pipelines to move data across systems. Develop and optimize data warehousing solutions, ensuring efficient data delivery and storage for analysis. Design distributed systems to apply machine learning and data science techniques. Requirements: Bachelor or higher degree in Computer Science or related fields 1+ years of hands-on experience with technologies such as Apache Spark, SQL, NoSQL, and PostgreSQL 1+ years' experience with Cloud Environments like GCP, AWS, or Azure is preferable Experience in AI/ML Platforms like BigQuery, TensorFlow, or another product is a big plus Project experience using Computer Vision is highly recommended Experience in building and optimizing data warehousing solutions Proficiency in at least one of the following programming languages: Java/Python Proficient in Unix and Linux operating systems Experience in writing Apache Spark is preferable Proficiency in designing and maintaining end-to-end data pipelines Strong knowledge of data warehousing concepts, architectures, anprocesses Proficiency in Postgres, MySQL, and also NoSQL Passionate about coding and programming, innovation, and solving challenging problems