Job Description:
We are seeking a highly skilled and motivated Software Engineer – Data Scientist to join our team. In this role, you will focus on leveraging data science techniques and software engineering principles to ensure the accuracy, integrity, and reliability of our data assets. You will collaborate with cross-functional teams to design and implement data-driven solutions that support data management strategies, data quality frameworks, and data governance processes. Your expertise in data analysis, machine learning, and data quality assurance will be key to driving data-driven decision-making across the organization.
Responsibilities:
- Develop and implement data management and data governance strategies, ensuring the accuracy and reliability of data assets.
- Design, develop, and deploy machine learning models and algorithms to analyze and interpret complex datasets.
- Collaborate with data engineering and IT teams to ensure effective data integration, validation, and migration processes.
- Perform data analysis, profiling, and data cleaning to identify and address data quality issues, anomalies, and inconsistencies.
- Design and implement data quality assurance processes to improve data accuracy, completeness, and integrity.
- Develop and maintain data quality metrics and reports, tracking and monitoring data quality performance across the organization.
- Conduct regular data audits and assessments to identify areas for improvement and implement corrective actions.
- Stay up to date with industry best practices, emerging trends in data science, and software engineering technologies related to data management and quality assurance.
Requirements:
- Bachelor’s degree in Computer Science, Data Science, Information Systems, or a related field (Master’s degree preferred).
- Proven experience with data science tools and frameworks such as Python, R, TensorFlow, or PyTorch.
- Strong experience with SQL and data manipulation for analysis and reporting.
- 5+ years of experience with Cloud-based data management platforms such as AWS, Azure, or Snowflake.
- Experience in developing machine learning models and working with large-scale datasets.
- Strong problem-solving skills with the ability to analyze complex data issues and develop effective solutions.
- Familiarity with data governance frameworks, data quality management tools, and ETL processes.
- Excellent communication and collaboration skills, with the ability to work with both technical and non-technical stakeholders.
- A team player who thrives in a fast-paced and dynamic work environment.
- Strong attention to detail with a commitment to maintaining data accuracy, quality, and integrity.
Preferred Skills:
- Experience with data quality tools such as Informatica Data Quality (IDQ), Talend, or other similar platforms.
- Familiarity with cloud-based data warehouses like Snowflake, Redshift, or BigQuery.
- Knowledge of software engineering practices, version control systems (e.g., Git), and Agile development methodologies.