Please strictly adhere to the following resume naming convention:
ALL CAPS, NO SPACES B/T UNDERSCORES
PTN_US_GBAMSREQID_CandidateBeelineID
i.e. PTN_US_9999999_SKIPJOHNSON0413
: MAX CONFIRMED- (Max)
MSP Owner: Shilpa Bajpai
Location: West Chester, PA - 100% ONSITE
Duration: 6 months
skill id: 10834326
Role: Data Engineer
Skills: Digital : Snowflake~Digital : PySpark
Experience Required: 6-8 Years
Role Descriptions: Pipeline Development:
Design and implement robust ETL/ELT pipelines using PySpark and SQL to ingest data from diverse sources including APIs| flat files| and relational databases
Data Modeling:
Develop and manage complex data models (e.g.| Star/Snowflake schemas) and maintain Fact and Dimension tables within Snowflake
Performance Optimization:
Monitor and tune Snowflake queries and Spark jobs to optimize performance| reduce latency| and manage computational costs
Data Quality & Integrity:
Implement automated data validation frameworks and testing procedures to ensure a ""single source of truth"" and high data reliability
Operations:
Troubleshoot production pipeline issues| manage version control via Git, Project Code :
ALL CAPS, NO SPACES B/T UNDERSCORES
PTN_US_GBAMSREQID_CandidateBeelineID
i.e. PTN_US_9999999_SKIPJOHNSON0413
: MAX CONFIRMED- (Max)
MSP Owner: Shilpa Bajpai
Location: West Chester, PA - 100% ONSITE
Duration: 6 months
skill id: 10834326
Role: Data Engineer
Skills: Digital : Snowflake~Digital : PySpark
Experience Required: 6-8 Years
Role Descriptions: Pipeline Development:
Design and implement robust ETL/ELT pipelines using PySpark and SQL to ingest data from diverse sources including APIs| flat files| and relational databases
Data Modeling:
Develop and manage complex data models (e.g.| Star/Snowflake schemas) and maintain Fact and Dimension tables within Snowflake
Performance Optimization:
Monitor and tune Snowflake queries and Spark jobs to optimize performance| reduce latency| and manage computational costs
Data Quality & Integrity:
Implement automated data validation frameworks and testing procedures to ensure a ""single source of truth"" and high data reliability
Operations:
Troubleshoot production pipeline issues| manage version control via Git, Project Code :