The Job logo

What

Where

Data Scientist - Intern

ApplyJoin for More Updates

You must Sign In before continuing to the company website to apply.

Job Description:

Airbus Innovation Centre - India & South Asia: 

 

Airbus Innovation Centre - India & South Asia is responsible for industrializing disruptive technologies by tapping into the strong engineering competencies centre while also leveraging and co-creating with the vibrant external ecosystems such as big Tech Enterprises, mature startups/MSMEs, national labs & universities and strategic partners (customers, suppliers etc.)

The technology areas that the Innovation Centre focus on are - Artificial Intelligence, Industrial Automation, Unmanned Air Systems, Connectivity, Space Tech, Autonomy, Decarbonization Technologies etc. among others.

Airbus Innovation Centre in India is 1 among 3 Innovation Centres globally for Airbus with a strong focus on A.I. and Digital Engineering. We build products from the ground up with the help of stakeholders from within Engineering and Digital competence centres (in addition to the external stakeholders mentioned above) to deliver operational excellence and contribute to the Innovation & Technology roadmap of the organization.

 

Title: 

High-Dimensional Constrained Design of Experiments for ML applications

 

Introduction:

Surrogate models are used in the area of multidisciplinary analysis and optimization. These surrogate models have the advantage over simulations that they can approximate the effects of parameter variations in real time. This enables savings in terms of time and costs when developing a new aircraft or aircraft variants. In addition, more variations of the parameters can be performed. The optimal point of design can be searched for and the necessary knowledge about the interrelationships of the parameters at the point of design is provided.

These surrogate models are (in our use cases) Machine Learning (ML) models. Accordingly, a data set must be available for the training of these surrogate models. The Design of Experiments (DoE) methodology is used to create an optimal data set for this purpose. The goal is to map ‘m’ simulation inputs to ‘n’ simulation outputs. The larger goal is to create an adaptive DoE which, based on the needs, either finds the optimum dataset or does active learning to increase the performance of the subsequently built surrogate model. Since the simulations are performed sequentially, it is possible to use the already calculated data points to determine the position in the design space where data points have the largest amount of information.

 

However, before investigating further we want to focus again more on the DoE. In order to decrease the design space, i.e. the space that optimization algorithms have to search through, a constraint DoE was developed. The aim of this work is to reach practical application of this constraint DoE by adding further input dimensions. A full description of an existing DoE to translate into a constraint DoE is available. It works today with a cubical “base” DoE whose domain is transformed, in a post-processing step, to comply with the underlying constraints. The problem with this methodology though is that the final sample distribution is not homogeneous. This again leads to potential bias and unnecessary large sample sizes for ML applications.


 

Key Responsibilities:

Today we have two constraint DoE's:

1. zerofuel_mass, zerofuel_cg, fuel_weight

2. altitude, speed, vertical_loadfactor

 

Learn about the existing DoE libraries (OpenTurns, JohnDoE) by producing unit tests, docstrings, and documentation

 

For the 1st DoE you switch from a temporary Fuel vector implementation to the official implementation

 

Enhance the 1st DoE by increasing the complexity through an additional dimension: fuel density

Enhance the 1st DoE by increasing the complexity through splitting dimension fuel_weight into: re_fuel_weight and de_fuel_weight

 

Merge the 1st and 2nd DoE into one DoE

add all missing independent further dimensions to reach practical applicability.

 

Qualifications:

Strong Python skills.

Statistics background

Experience with Data Wrangling and Preprocessing.

Experience in Design of Experiments for Data Generation.

Proficiency in version control systems (e.g., Git) and software development best practices.

Machine Learning & Deep Learning Model Development Cycle.

 

EDUCATION:
M.Sc. / M.Eng. in Computer Science, Date Engineering, Mathematics, Aerospace

Set alert for similar jobsData Scientist - Intern role in Bengaluru, India
Airbus Logo

Company

Airbus

Job Posted

18 days ago

Job Type

Full-time

WorkMode

On-site

Experience Level

0-2 Years

Category

Data Science

Locations

Bengaluru, Karnataka, India

Qualification

Bachelor

Applicants

106 applicants

Related Jobs

NetApp Logo

Intern - Data and Applied Scientist

NetApp

Bengaluru, Karnataka, India

Posted: 7 months ago

NetApp is seeking a Data & Applied Scientist Intern to join the Data Services organization. The intern will create synthetic data, improve NLP models, co-author papers, and work on computer vision and NLP projects.

Ericsson Logo

Intern Data Scientist

Ericsson

Chennai, Tamil Nadu, India

Posted: 5 months ago

Description Join our Team About this opportunity: We are seeking a versatile Data Scientist to join our dynamic team at Ericsson. You will play a pivotal role in harnessing machine learning solutions to solve complex business problems. Predicated on scientific methods, process-driven systems, you will be the driving force behind Ericsson's applied analytics. You will be expected to understand classical and advanced machine learning concepts and apply this knowledge practically to fulfil customer requirements. What you will do: Participate in mapping requirements to implementation - Analysing, coordinating, prioritizing, and optimizing requirements. Ensuring implementation even with constraints. Work with data and develop predictive models, recommendation engines, anomaly detection systems, statistical models, deep learning models, and other machine learning systems Good understanding of machine learning concepts and programming languages like Python, Pyspark, SQL etc. based on latest market trends Ability to work within constraints and timelines and follow the delivery standards and processes defined within Ericsson Good understanding and implementation knowhow of on-premise and well as on-cloud machine learning solutions The skills you bring: - Business Understanding. - Artificial Intelligence Systems. - Software Engineering. - Data Management. - Ericsson Business Intelligence and Analytics Competence. - Open-Source Programming Languages. - Data Preprocessing. - Statistics. - Cloud Development. - Machine Learning Algorithms.

Myntra Logo

Data Scientist

Myntra

Bengaluru, Karnataka, India

Posted: 4 months ago

About Team Myntra Data Science team delivers a large number of data science solutions for the company which are deployed at various customer touch points every quarter. The models create significant revenue and customer experience impact. The models involve real-time, near-real-time and offline solutions with varying latency requirements. The models are built using massive datasets. You will have the opportunity to be part of a rapidly growing organization and gain exposure to all the parts of a comprehensive ecommerce platform. You’ll also get to learn the intricacies of building models that serve millions of requests per second at sub second latency.  The team takes pride in deploying solutions that not only leverage state of the art machine learning models like graph neural networks, diffusion models, transformers, representation learning, optimization methods and bayesian modeling but also contribute to research literature with multiple peer-reviewed research papers. Roles and Responsibilities Responsible for developing data science and machine learning models for Myntra Storefront, Supply Chain and other areas Conversant with machine learning life cycles, model deployments etc. Theoretical understanding and practise of machine learning and expertise in one or more of the topics, such as, NLP, Computer Vision, and Optimisation. Collaborating with Product and Business to formulate the problem into a Machine Learning model. Connecting with Platforms and Engineering teams to make sure the predictive models built are deployed and integrated into the systems. Working with the Data Platforms teams for understanding and collecting the data. Qualifications & Experience Knowledge on data structures, algorithms and efficient processing of large datasets Well-versed with Python or equivalent programming language. People with MTech, MS by Research or PhD in Computer Science/Electrical Engineering, or post graduate degree in Statistics, Operations Research, or Mathematics are preferred.  Having authored publications is plus Good to have skills in general Machine Learning and Deep Learning such as Natural Language Processing (NLP): Skills in NLP to develop models that understand and generate human language. Computer Vision: Knowledge of computer vision techniques to interpret images and videos, including object identification and classification. DevOps: Familiarity with DevOps principles and practices for deploying and managing machine learning models in production environments. LLM and GenAI Models: Understanding of Large Language Models (LLM) and Other Generative AI (GenAI) models, including prompt engineering, model evaluation, optimization, and deployment. Domain Knowledge: Expertise in a specific industry or field, enabling better understanding of data and business needs. Ethical Data Science: Awareness of ethical considerations in data science, including privacy, bias, and fairness. Collaboration: Strong communication and teamwork skills for effective collaboration with cross-functional teams, including engineers, product managers, and designers.