-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathTed_Zhao_ML.yaml
197 lines (195 loc) · 8.38 KB
/
Ted_Zhao_ML.yaml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
cv:
name: Ted Zhao
location: 10/4 Sheppards Lane, Norwood SA 5067, Australia
email: [email protected]
phone: "+61-416-551-622"
social_networks:
- network: LinkedIn
username: ted-zhao
- network: GitHub
username: tade0726
sections:
summary:
- bullet: Machine Learning Engineer with 5+ years of experience delivering end-to-end AI solutions across fintech, travel, and retail sectors.
- bullet: Expertise in RAG systems, computer vision, and reinforcement learning, improving business metrics by 2-10% across projects.
- bullet: Strong MLOps background with Databricks, MLflow, and ZenML, focusing on production-grade model deployment and monitoring.
- bullet: Track record of innovative solutions from credit scoring to LLM-powered chatbots, consistently driving business value through AI.
experience:
- company: Rainmakr.ai
position: Data/AI Engineer
location: Adelaide, SA (Remote)
start_date: 2023-05
end_date: 2024-07
highlights:
- Developed LLM prompts with RAG to categorise company data into GICS-like themes, enhancing industry analysis.
- Built knowledge graphs from LLMs to extract entity relationships (executives, competitors, supply chains) for financial insights.
- Simplified 30+ table data models with DBT, reducing schema size by 50% and redundant data by 15%.
- Optimised queries through join refinement and re-indexing, cutting runtime by 50%.
- Integrated DBT with Dagster Cloud for mixed orchestration of notebook and DBT jobs.
- company: Skyscanner
position: ML Engineer/Data Scientist
location: Shenzhen, China
start_date: 2019-11
end_date: 2021-07
highlights:
- Led end-to-end ML pipeline in Databricks/MLflow, engineering user behaviour features and iteratively improving CTR by 5-10%.
- Deployed ensemble of LightGBM and Deep Learning models with A/B testing framework for continuous model refinement.
- Developed ResNet-based classifier for 20 hotel categories with hierarchical labelling system, achieving 99% Top-5 accuracy.
- Enhanced image selection using MobileNet with multi-dimensional aesthetic scoring (composition, lighting, detail), lifting CTR by 3%.
- Built T+1 batch processing pipeline using AWS Lambda/PySpark for feature extraction, storing in Parquet format.
- Engineered real-time feature streaming from Elasticsearch with Redis caching, improving inference CTR by 2%.
- company: DJI
position: Data Scientist
location: Shenzhen, China
start_date: 2018-04
end_date: 2018-08
highlights:
- Built Scrapy-based web crawler to gather city demographics and competitor store distribution data for market analysis.
- Developed quantitative ranking algorithm incorporating economic indicators and business factors for city prioritisation.
- Analysed post-launch user demographics (location, income, age) to create detailed customer segments.
- Monitored sales patterns and provided data-driven insights for campaign strategy optimisation.
- Developed Django/SQLAlchemy-based API for Jupyter extension, enabling efficient SQL querying and data model management.
- company: Smart Decision Technology Ltd
position: Data Scientist
location: Shenzhen, China
start_date: 2015-08
end_date: 2018-03
highlights:
- Implemented credit scorecard system with feature selection, WOE calculation, and logistic model conversion in Python.
- Built SimPy-based logistics simulation with Flask/SQLAlchemy backend, securing 2-year contract with S.F. Express.
- Applied K-Means clustering for HKJC member segmentation, delivering lifecycle-based targeting strategies.
- Developed Chinese text sentiment analysis using Word2Vec and K-means for thematic clustering.
- Built supervised learning model with custom Chinese tokenization for app review classification.
projects:
- name: ATO Chatbot
date: 2023
highlights:
- "**Live Demo**: [ato-chat.streamlit.app](https://ato-chat.streamlit.app/) | **Source**: [github.com/tade0726/ato_chatbot](https://github.com/tade0726/ato_chatbot)"
- Built production-grade RAG system with LlamaIndex and Qdrant vector store, enabling context-aware tax information retrieval.
- Implemented MLOps pipeline with ZenML for data ingestion, OpenAI fine-tuning, and automated model evaluation.
- Developed Streamlit-based UI with real-time query rephrasing and source citations for enhanced user trust.
- name: Quantitative Trading with Reinforcement Learning
location: University of Adelaide
date: 2022-09
end_date: 2023-02
highlights:
- Researched and developed crypto trading strategy using Reinforcement Learning at University of Adelaide.
- Designed multi-asset portfolio optimisation with Sharpe Ratio reward system for buy/sell quantity decisions.
- Built dual Deep Learning modules for price signal prediction and market sentiment analysis.
- Implemented comprehensive backtesting framework evaluating Sharpe Ratio and Maximum Drawdown (MDD).
- Developed profit analysis system incorporating trading fees and compound returns visualisation.
skills:
- label: ML/AI
details: Reinforcement Learning, Deep Learning, Clustering, LightGBM, XGBoost, SVM, Word2Vec, A/B Testing
- label: MLOps/Cloud
details: Azure ML, AWS, SageMaker, MLflow
- label: Data Engineering
details: DBT, Databricks, Snowflake, Dagster
- label: Programming
details: Python, VBA, Scala, Julia
- label: Web Development
details: Django, Flask, Asyncio, Scrapy
- label: ETL/Data Tools
details: SQL, SQLAlchemy, Spark, Pandas, Tableau
- label: Version Control & Containers
details: Git, Docker
education:
- institution: University of Adelaide
area: Machine Learning
degree: Master
location: Adelaide, SA
start_date: 2021-06
end_date: 2023-05
- institution: Shenzhen University
area: Mathematical Sciences
degree: Bachelor
location: Shenzhen, China
start_date: 2010-09
end_date: 2014-07
languages:
- "**English**: IELTS 7.0 **Mandarin**: Native **Cantonese**: Native"
certifications:
- area: Machine Learning - https://www.coursera.org/account/accomplishments/verify/Q89DMB9RYYTB
institution: Coursera
date: 2016-10
- area: Machine Learning Nanodegree - https://graduation.udacity.com/confirm/PKAQPLU2
institution: Udacity
date: 2018-12
design:
theme: engineeringresumes
font: Charter
font_size: 10pt
page_size: letterpaper
color: black
disable_external_link_icons: true
disable_page_numbering: true
disable_last_updated_date: true
header_font_size: 25 pt
text_alignment: left-aligned
separator_between_connections: $|$
use_icons_for_connections: false
margins:
page:
top: 2 cm
bottom: 2 cm
left: 2 cm
right: 2 cm
section_title:
top: 0.3 cm
bottom: 0.2 cm
entry_area:
left_and_right: 0 cm
vertical_between: 0.2 cm
date_and_location_width: 4.5 cm
highlights_area:
top: 0.10 cm
left: 0 cm
vertical_between_bullet_points: 0.10 cm
header:
vertical_between_name_and_connections: 5 pt
bottom: 5 pt
horizontal_between_connections: 10 pt
locale_catalog:
phone_number_format: national
page_numbering_style: NAME - Page PAGE_NUMBER of TOTAL_PAGES
last_updated_date_style: Last updated in TODAY
date_style: MONTH_ABBREVIATION YEAR
month: month
months: months
year: year
years: years
present: present
to: –
abbreviations_for_months:
- Jan
- Feb
- Mar
- Apr
- May
- June
- July
- Aug
- Sept
- Oct
- Nov
- Dec
full_names_of_months:
- January
- February
- March
- April
- May
- June
- July
- August
- September
- October
- November
- December
rendercv_settings:
render_command:
output_folder_name: rendercv_output_ml
pdf_path: ted_zhao_ml_cv.pdf
dont_generate_html: false
dont_generate_markdown: false
dont_generate_png: false