Thanathon Boonyakijjinda

Name: Thanathorn Boonyakijjinda

Nickname: Chopper

Age: 25

Job Position: Data Engineer

Company: Bluebik Vulcan

bachelor's degrees: Thammasat University

GPA: 3.58

About Me

May I introduce myself? My name is Thanathorn Boonyakijjinda, but you can call me Chopper. I was born on 11 April 2000, making me 25 years old. I'm from Thailand and currently living in Bangkok. For my educational background, I graduated from Thammasat University with a bachelor's degree in Electrical Power Engineering with a GPA of 3.58. I have a passion for continuous learning and am always eager to enhance my skills. I am easygoing and cheerful, which helps me build strong relationships and collaborate effectively with others.

Skills

Professional Skills.

Key Skills

  • Python (Google API, bs4 selenium, tkinter, pyinstaller, matplotlib, seaborn, Flask)
  • R program (dplyr, dummies, tidyverse, syuzhet)
  • Database (PostgreSQL (Trigger, Function), BigQuery, Spanner, SQLite, MongoDB)
  • Excel (VLookup, Pivot-table, IF, Query, Google Sheets)
  • JavaScripts (APP SCRIPTS)
  • Docker, Kubernetes
  • GitHub, Vercel
  • ChatGPT
  • Google cloud platform (GKE, GCS, ETC.)

Key Skills Visualization

  • Looker Studio
  • Power BI

Key Skills Tools Of Data

  • Data Lake (AWS S3, GCS, MinIO)
  • Data warehouse (Databricks, Snowflake, Bigquery, Spanner)
  • Data pipeline (Fivetran)
  • Data transform (DBT, PySpark)
  • Data Orchestration (Airflow, Dagster)

Experience

My experience worked on.

A Lot tech (Aug 2024-Jan 2025)

Data Analyst

My responsibilities encompass extracting data from PDF files through web scraping, designing and maintaining efficient ETL pipelines, automating daily data workflows, and delivering impactful reports and visualizations. These efforts empower the marketing team with strategic insights to drive informed decision-making.

Key Skill: Bs4, request, GCP API, JupyterLab, Colab, SQL, PowerBI

Bluebik Vulcan (Jan 2025 - current)

Data Engineer

  • Cost Optimization

    I have been commissioned to optimize costs in Google Cloud Storage (GCS), to reduce storage costs by analyzing data usage patterns and implementing storage policies that align with data access frequency.
    • Storage Usage Patterns: Reviewed object access logs to identify inactive (cold) data, monitored storage growth and usage trends using Cloud Monitoring and Logging.
    • Data by Access Frequency: Categorized data into Hot (frequently accessed), Cold (rarely accessed), and Archived (long-term retention).
    • Lifecycle Policies: Configured GCS lifecycle rules to automatically move or delete files based on access time.
    • Redundant or Unused Files: Wrote Python scripts to detect and delete duplicate or obsolete files, coordinated with data owners to validate file necessity.
  • Vector Search

    Built a scalable and intelligent search system that combines full-text search and vector search (Hybrid Search) using Google Cloud Spanner as the backend, with data pipeline orchestration via Dagster.
    • Built a Dagster Data Pipeline: Developed a pipeline to extract data from CMS-stored tables in BigQuery, transformed and loaded the data into Google Cloud Spanner.
    • Spanner Schema for Hybrid Search: Defined schema to store both text fields and vector embeddings (index, partitions), ensured data consistency and optimized schema for hybrid querying.
    • Full-text and Vector Search: Enabled full-text indexing within Spanner, integrated vector similarity search using approximate nearest neighbor (ANN) techniques.
    • Hybrid Search Logic in App Layer: Combined full-text and vector search results into a unified ranking system, built an API layer to expose the search functionality to client applications.

Key Skill: GCP, GCS, GKE, Dagster, Datahub, BigQuery, Spanner, VectorSearch, Full-TextSearch

Portfolio

My projects worked on.

ETL with Airflows Docker

Airflows / 10 Sep 2024

ELT with Dagster

Dagster / 11 Sep 2024

Aws and Databricks

Aws Databricks / 17 Sep. 2024

Application Ordersales By Flask , Vercel , MongoDB , Pyspark and PostgreSQL

Semi-structure and structure / 17 Sep. 2024

Web scraping By python

Python program / 25 Apr. 2023

CERTIFICATE

My Upgrade Skill with Data and programming skill

DATACAMP

DATA ENGINEER

This is certificate for datacamp:It covers data management theory , data management in SQL and Python and exploratory analysis theory

DataTH

Road to Analytics Engineer

Teach the entire process of Data Engineering from start to finish, overing topics such as GitHub, Data Lake, Data Warehouse, Data Lakehouse, Data Pipelines, Data Architecture, Data Orchestration, etc. The tools used include AWS, Databricks, Snowflake, Fivetran, Airbyte, dbt, Terraform, and Dagster.

DATACAMP

SQL ASSOCIATE

This is certificate for datacamp: It covers SQL in Data management , Exploratory Analysis includes math statistics

TRUE ACADEMY

DigiProof Assessment : Basic Data Analyst

This certificate for digital true academy: demonstrating their skills and knowledge in key areas for the of role junior data analyst. such as Data Analytics Foundation , Data Preparation and Cleaning , Data Engineering and Management , Exploratory Data Analysis , Data Visualization , Modeling , Math and Statistics and Data Science Comprehensive

Google Coursera

Google Advanced Data Analytics

practice-based assessments and is designed to prepare them for advanced roles in data analytics and entry-level roles in data science. They are competent in exploring large datasets, applying data analysis techinques, and building models to extract insights. They are also competent in machine learning , predictive modeling , and statistics.

Google Coursera

Google Data Analytics

This course is Google Data Analytics from Google on Coursera, which includes hands-on, practice-based assessments and is designed to prepare learners for introductory-level roles in data analytics. Participants will become proficient in tools and platforms such as spreadsheets, SQL, Tableau, and R.

Microsoft Fundamental

Microsoft Azure AI Fundamentals

This is a certificate for Microsoft Azure: Azure AI Fundamentals (AI-900). It covers fundamental concepts in data science, data engineering, and data analytics.

Microsoft Fundamental

Microsoft Azure Data Fundamentals

This is a certificate for Microsoft Azure: Azure Data Fundamentals (DP-900). It covers fundamental concepts in data science, data engineering, and data analytics. And Sever Data Services as Azure sql database.

Dagster DBT

Dagster

This course is offered by Dagster University and teaches the usage of Dagster tools, which work with the Python language. It also includes the use of DBT as an add-on to enhance the efficiency of ELT processes.

FutureSkill

Business_analysis R program

This course focuses on R programming for machine learning, with a focus on using decision trees to predict user engagement with mobile apps And CRISP-DM process . which consists of business understanding, data understanding, data preparation, modeling, evaluation, and deployment.

FutureSkill

Machine Learning Business

This course covers advanced machine learning topics in Python, with a focus on logistic regression models for predicting loan approval. how to use WoE (Weight of Evidence) and IV (Information Value) to increase model performance, as well as how to apply machine learning techniques such as VIF (Variance Inflation Factor) and AUC (Area Under the Curve) to assess model accuracy and performance.

Ultimate Python

Web Scraping By Ultimate Python

This course involves practicing how to extract data from web platforms such as Condo Trading using Python programming and save it into CSV and XLSX file formats.

Contact Me

Feel free to reach out to me via the information below.