• Hi!
    I'm Abdullah

    I am currently working as a data engineer (DE) at Xref/TSS.

About Me

Who Am I?

Hi I'm Abdullah. I am a Data Engineer at Xref and Telenor Shared Services.

As a Data Engineer, I have a passion for turning data into actionable insights and building scalable data solutions. With hands-on experience in cloud environments like AWS and a strong foundation in data modeling, warehousing, and data streaming, I excel at creating robust data pipelines and optimizing complex queries for large-scale data processing.

I have worked on diverse projects—from developing ML models to enhance financial operations at Xref, to contributing to Ivy's open-source graph compiler, enabling seamless code conversions across ML frameworks. My expertise spans across Python, SQL, Kafka, PostgreSQL, TensorFlow, and PyTorch, allowing me to deliver high-impact solutions that drive business success.

I thrive in collaborative, fast-paced environments and am always eager to explore innovative ways to solve complex data challenges. I'm also a passionate open-source contributor, constantly seeking opportunities to learn, share, and grow within the data science community.

What I do?

Here are some of my overall skills

Data Engineering

I've experience in Data Pipeline Development, Data Architecture and Modeling, Data Integration and ETL, Big Data Technologies, Cloud Computing, Automation, and Data Governance and Quality

ML/DL

I have experience in research, designing and developing sophisticated ML solutions for real-world problems using traditional and cutting-edge ML solutions.

Skills

Knowing that technology is evolving fast, made me passionate about learning new concepts and skills. Ability to implement novel ideas and learn new Technologies.

My Specialty

My Technical Skills

My skills are divided into three sections: Professional, Intermediate, and Familiar. The skills in the Professional section are the tools that I'm working with regularly.

Professional

Programming Languages    Python, C++, C, Java, Scala
   Libraries & Frameworks AWS, DBT, Kafka, Elasticsearch,
Data Modeling, Data Warehousing
PyTorch, Keras, MLflow
Hugging Face Transformers, spaCy, NLTK, Cython
NumPy, scikit-learn, pandas, Mathplotlib, jupyter notebooks
   Databases DBMS:  MySQL, SQL, Redshift, MYSQL Server, MongoDB, DynamoDB, PostgreSQL, DuckDB
Cloud    Docker    AWS
Version control    Git,    GitHub,    GitLab
Operating system   Linux,    Mac,    Windows,
Writing Tools Google Docs, Microsoft Office , LATEX
Workflow Agile Development & Scrum
others    Slack,   Trello

Intermediate

Programming Languages    Java, C, C++
   HTML, CSS, JavaScript (Bootstrap), Flask, Node.js, JavaFX, SwiftUI
MASM Assembly, MATLAB, R
   Python Libraries JAX, pytest, py2neo, selenium, PyMongo, PyMsql, ...
Programming platforms    Web
   Databases DBMS:  SQLserver(familiar)
NoSQL: Neo4j, MongoDB
Education

Education

B.S. Computer Science May 2018 – July 2022

PUCIT, Lahore, Punjab, Pakistan.
GPA: 3.40 (Out of 4.0), via 12 credit
FYP title: "Crop Price Prediction"
Courses:

  • Data Science: A+
  • Machine Learning: A+
  • System Programming: A-
  • Cloud Computing: A

Experience

Work Experience

Data Engineer Feb 2025 – Present

Telenor Shared Services, Islamabad, PK (Full-time/Hybrid)

  • Designed and implemented a comprehensive data architecture from the ground up, establishing scalable data pipelines and storage solutions to support organizational data needs.
  • Developed and maintained robust ETL processes, facilitating the seamless extraction, transformation, and loading of data from diverse sources, thereby enhancing data accessibility and integrity.
  • LEngineered complex data transformations to process and analyze large datasets, enabling actionable insights that drive business decisions across multiple business units in over 15 countries.

Data Engineer Nov 2023 – Present

Xref, Sydney, AU (Contract/Remote)

  • Optimized complex queries using Common Table Expressions (CTEs) for efficient data retrieval and transformation.
  • Integrated data from multiple sources, resolving scattered dimensions across tables and objects.
  • Leveraged AWS cloud-based tools for data-related operations and developed ML models to optimize financial tasks.

Machine Learning Research Engineer July 2022 – Nov 2023

Unfiy AI, London, UK (Full-time/Remote)

  • Designed Ivy's graph compiler and transpiler for automatic code conversions between frameworks.
  • Collaborated with open-source partners to integrate Ivy into popular repositories, adding state-of-the-art models to the Ivy model hub.

My Work

Projects

TSS/Current Projects

  • Designed and implemented a comprehensive data architecture from the ground up, establishing scalable data pipelines and storage solutions to support organizational data needs.
  • Developed and maintained robust ETL processes, facilitating the seamless extraction, transformation, and loading of data from diverse sources, thereby enhancing data accessibility and integrity.
  • Engineered complex data transformations to process and analyze large datasets, enabling actionable insights that drive business decisions across multiple business units in over 15 countries.
  • Technologies: AWS, DBT, PostgreSQL, Data Modeling, Data Warehousing, Kafka, Scala, Spark, Hadoop, Tableau, MongoDB, Elasticsearch, DynamoDB, Scikit-Learn.

Xref/Current Projects

  • Optimized complex queries using Common Table Expressions (CTEs) for efficient data retrieval and transformation.
  • Integrated data from multiple sources, resolving shattered dimensions across tables and objects.
  • Leveraged AWS cloud-based tools for data-related operations and developed ML models to optimize financial tasks.

Unify Projects

  • Designed Ivy's graph compiler and transpiler for automatic code conversions between frameworks.
  • Collaborated with open-source partners to integrate Ivy into popular repositories, adding state-of-the-art models to the Ivy model hub.

Open Source Projects

  • JAX: Contributed to improve dlpack module in JAX (Google Deep Learning framework).
  • LlamaIndex: Built context-augmented generative AI applications with LLMs using Python, LlamaIndex, OpenAI, and Streamlit.
  • Ivy: Unified multiple ML frameworks and made several contributions to the repository.
  • LibrePhotos: Contributed to an open-source, self-hosted photo management service using Python, Django, PyTorch, and React.

University Project

  • Crop Price Prediction: Developed ML models to predict crop prices using classification, random forest, and linear regression.
Get in Touch

Contact

Lahore, Punjab, Pakistan