Title: AWS Data Engineer (PySpark/Sagemaker) Location: New York, NY (Open to Remote) Duration: Long-term Contract... ETL/ELT pipelines using PySpark. Proven experience with AWS Redshift and Redshift Spectrum. Strong SQL skills for data...
AWS, big-data frameworks (e.g., Spark, Flink), and CI/CD best practices to ensure reliable, low-latency data feeds... Infrastructure: Design and maintain AWS-based data platforms (S3, EMR/Spark, Redshift/Athena/Glue). Optimize data storage...
of basic data science libraries in Python (NumPy, pandas, scikit-learn, pyspark) Understanding of the main deep-learning... understanding of traditional data science techniques and experience with data engineer pipelines for big data Solid knowledge...
. Preferred qualifications, capabilities, and skills: Experience in big data platforms such as Databricks, AWS EMR, Sagemaker... science techniques and experience with data engineer pipelines for big data Solid knowledge of RNNs, and LSTMs models...