W-2 Jobs Portal

  • W-2 Open Positions Need to be Filled Immediately. Consultant must be on our company payroll, Corp-to-Corp (C2C) is not allowed.
Candidates encouraged to apply directly using this portal. We do not accept resumes from other company/ third-party recruiters

Job Overview

  • Job ID:

    J37928

  • Specialized Area:

    Python

  • Job Title:

    Python Data engineer

  • Location:

    south san francisco,CA

  • Duration:

    9 Months

  • Domain Exposure:

    Government, Education, IT/Software

  • Work Authorization:

    US Citizen, Green Card, OPT-EAD, CPT, H-1B,
    H4-EAD, L2-EAD, GC-EAD

  • Client:

    To Be Discussed Later

  • Employment Type:

    W-2 (Consultant must be on our company payroll. C2C is not allowed)




Job Description

Job Description

Immediate need for a senior data engineer who can also do some data science work. Experience in Biomedical space is desired.

Responsibilities

  • Assemble large, complex data sets in the format fit for each use case
  • Write generic Python/Pyspark modules for processing data from various data sources (XML, Parquet, CSV, Relational)
  • Demonstrable experience architecting, developing and optimizing ETL pipelines using Python, Spark, EMR and Airflow
  • Develop and optimize big data pipelines for data scientists (requires a basic understanding of data science concepts and ML)
  • Research and recommend new innovative methods and systems to manage data for business improvement
  • Participate in internal governance to drive the data quality business cycle and roadmap

Required Skills

Python, Spark, ETL/Data engineering, S3 based datalake in AWS. Development and management of Airflow based data flows

  • Bachelor s or Master s degree in computer science or software engineering
  • 3+ years of programming experience (including functional programming); must be advanced in Python
  • Experience building and optimizing big data pipelines using Spark
  • Experience with AWS cloud services: S3, EC2, EMR, RDS, Redshift, PySpark, Airflow
  • Experience with relational SQL and NoSQL databases, including Postgres
  • Solid understanding of how to design robust data workflows including optimization and user experience
  • Strong analytical and problem-solving skills
  • Excellent oral and written communication skills
  • Able to work in teams and collaborate with others to clarify requirements
  • Strong co-ordination and project management skills to handle complex projects
  • Experience developing and working with XML, JSON, and external web services

Preferred Qualifications

  • Clinical drug development domain knowledge
  • Experience working with clinical and biomedical data types (clinical patient data, omics, imaging, etc.)
  • Competencies in applied statistics to solve business needs
  • Knowledge of industry data standards used in drug development, particularly in Clinical development

Apply Now
Equal Opportunity Employer

ARTIFICIAL INTELLIGENCE TECHNOLOGIES LLC is an equal opportunity employer inclusive of female, minority, disability and veterans, (M/F/D/V). Hiring, promotion, transfer, compensation, benefits, discipline, termination and all other employment decisions are made without regard to race, color, religion, sex, sexual orientation, gender identity, age, disability, national origin, citizenship/immigration status, veteran status or any other protected status. ARTIFICIAL INTELLIGENCE TECHNOLOGIES LLC will not make any posting or employment decision that does not comply with applicable laws relating to labor and employment, equal opportunity, employment eligibility requirements or related matters. Nor will ARTIFICIAL INTELLIGENCE TECHNOLOGIES LLC require in a posting or otherwise U.S. citizenship or lawful permanent residency in the U.S. as a condition of employment except as necessary to comply with law, regulation, executive order, or federal, state, or local government contract