← Back to jobs

Senior PySpark Developer - Vice President

Information Technology Senior Management Forum
FULL_TIME Remote · US Tampa, FL, Hillsborough, US USD 9487–14230 / month Posted: 2026-05-11 Until: 2026-07-10
Apply Now →
You will be redirected to the original job posting on BeBee.
Apply directly with the employer.
Job Description
Posted Date 5/08/2026 Description Job Description: We are seeking a highly skilled and experienced Senior PySpark Developer to join our dynamic technology team. This role requires an individual with deep expertise in Python, PySpark, Big Data technologies, and SQL, coupled with a strong ability to work independently and contribute significantly to complex data engineering initiatives. The ideal candidate will have a proven track record in designing, developing, and optimizing scalable data solutions, with experience in ETL processes and a keen interest in leveraging the latest technologies. Domain knowledge in Finance will be a significant advantage, enabling the candidate to contribute to critical financial crime compliance projects. Responsibilities: Design, develop, and implement robust, scalable, and high-performance data pipelines and applications using Python, PySpark, and Big Data technologies. Work autonomously to analyze requirements, propose technical solutions, and deliver high-quality code and data products, ensuring alignment with architectural standards and business objectives. Utilize expertise in various Big Data platforms (e.g., Hadoop, Hive, Kafka, Spark) to process, transform, and manage large datasets efficiently. Write complex SQL queries, stored procedures, and optimize database performance for large-scale data warehousing and analytics solutions. Develop and enhance ETL (Extract, Transform, Load) processes, ensuring data quality, integrity, and timely delivery. Experience with various ETL tools and methodologies is a plus. Proactively research, evaluate, and integrate new and emerging technologies, frameworks, and tools to improve development processes and solution capabilities. Ensure adherence to coding standards, conduct thorough code reviews, and implement best practices for software development, data governance, and security. Diagnose and resolve complex technical issues related to data pipelines, performance bottlenecks, and system integrations in a fast-paced environment. Collaborate effectively with cross-functional teams including architects, data scientists, business analysts, and QA engineers. Provide technical guidance and mentorship to junior team members. Identify opportunities to use AI tools to speed up development, code reviews, unit testing and deployment. Experience: 10+ years of experience in Applications Development, Systems Analysis, or equivalent senior engineering roles. Extensive hands-on experience delivering enterprise-scale, database-driven platforms in a regulated environment. Technical Skills Expert-level proficiency in Python programming, including object-oriented design, data structures, algorithms, and extensive experience with various Python libraries. Deep expertise in developing, optimizing, and deploying PySpark applications for large-scale data processing, ETL, and real-time analytics on distributed systems (e.g., Spark SQL, Spark Streaming, DataFrames). Strong understanding of Apache Spark architecture, Hadoop ecosystem, and experience with distributed computing concepts. Familiarity with big data storage formats (e.g., Parquet, ORC). Solid experience with both relational databases (e.g., Oracle) and NoSQL databases (e.g., MongoDB). Strong SQL writing and optimization skills. Experience in designing, developing, and consuming RESTful APIs using Python frameworks (e.g., Flask, FastAPI, Django REST Framework). Strong understanding and practical experience with CI/CD tools (e.g., Jenkins) and containerization technologies (Docker, Kubernetes). Expert-level proficiency with Git. Experience with unit testing (e.g., Pytest), integration testing, and performance testing frameworks for Python and PySpark applications. Exposure to at least one major cloud provider (AWS, Azure, or GCP), specifically with their compute, storage, and data services (e.g., S3, ADLS, EMR, Databricks, Azure Synapse) preferred. Exposure to or direct experience with Artificial Intelligence (AI) and Machine Learning (ML) concepts, frameworks (e.g., TensorFlow, PyTorch), or relevant projects is a significant advantage Soft Skills: Exceptional analytical and problem-solving abilities, with a strong capacity to understand complex business needs and translate them into effective technical solutions. Excellent leadership, team management, and mentoring capabilities. Superior verbal and written communication skills, with the ability to articulate complex technical concepts clearly to both technical and non-technical audiences. Strong collaboration and interpersonal skills, with a proven ability to wo