cv简历

My Curriculum Vitae个人简历

Basics

Name Zhaodong Liu
Label Undergraduate Student
Email zl4789@nyu.edu
Phone +86 131-5629-6623
Summary A Computer Science and Mathematics double major student at New York University Shanghai with research experience in generative retrieval and large language models.

Education

  • 2026.08 - 2027.12

    Pittsburgh, PA, USA

    Master of Science
    Carnegie Mellon University
    Computational Data Science
  • 2022.09 - 2026.05

    Shanghai, China

    Bachelor of Science
    New York University Shanghai
    Computer Science and Mathematics (Double Major), Cum Laude
    • Study-away: NYU New York, Fall 2024 and Spring 2025
    • Relevant Coursework: Multivariable Calculus, Honors Linear Algebra, Probability & Statistics, Honors Numerical Analysis, Discrete Mathematics, Analysis, Ordinary Differential Equations, Data Structures, Basic Algorithms, Operating Systems, Introduction to Databases, Machine Learning, Natural Language Processing, Stochastic Processes
    • Honors: Dean's List (Academic Years: 2022-2025), Founders' Day Award (Annual Distinguished Student Award)

Projects

  • 2025.05 - 2026.05
    New York University Shanghai
    Generative Retrieval for Multimodal Recommendation System
    • Advisor: Prof. Hongyi Wen (Department of Computer Science, Data Science and Engineering)
    • Developed a multimodal framework to predict user behavioral patterns on Amazon datasets.
    • Designed and evaluated model architectures, implemented quantization methods of VQ-VAE, RQ-VAE, OPQ, to capture discrete behavior patterns, achieving 47.2% performance improvement (from 0.0339 to 0.0499).
    • Implemented BPE-inspired partitioning tokenization, improving sequence coherence and representation quality.
  • 2024.05 - 2024.08
    New York University Shanghai
    Efficient Driving Route Optimization Considering Traffic Light Timings
    • Advisor: Prof. Zhibin Chen (Department of Computer Science, Data Science and Engineering)
    • Built a traffic-aware driving route optimization model under real-world map and constraints on SUMO platform.
    • Simulated model on real traffic signal datasets, reducing travel time compared to baseline routing algorithms.
    • Applied energy consumption models for EV and gasoline vehicles, balancing energy efficiency and travel time.
  • 2024.09 - 2024.12
    Center for Data Science, New York University
    Finetuning on Pretrained LM for Efficient Retrosynthesis of Chemical Reactions
    • Developed predictive models for chemical reaction retrosynthesis using SMILES notation.
    • Applied an LLM (LlaSMol-Llama2) for dataset reconstruction and augmentation, improving Top-1 Accuracy by 26.2% (from 0.588 to 0.742) while reducing reliance on expensive laboratory data.
    • Employed fine-tuning strategies (encoder freezing, MLP adapters, LoRA) to optimize training efficiency.
  • 2024.10 - 2024.12
    Tandon School of Engineering, New York University
    Fantasy Sports League Database
    • Designed a 13-table relational database in MySQL supporting multi-league sports (Football, Basketball, Soccer) with user authentication, draft scheduling, roster management, trading systems, and 390+ sample records.
    • Implemented ACID-compliant transaction management, automated triggers, and stored procedures to enforce referential integrity and achieve efficient query performance through proper indexing.
    • Built a Python backend for user authentication, data querying, and business logic; developed an HTML/CSS frontend for visual operations, integrating the full stack with the MySQL database.
  • 2025.03 - 2025.05
    Tandon School of Engineering, New York University
    Collaborative Piano
    • Built a networked real-time piano application in Java enabling two users to perform together, featuring synchronized audio playback, integrated chat, and session recording with precise timestamps.
    • Implemented TCP socket networking with thread concurrency and Java Sound API supporting multiple instrument timbres including synthesized waveforms and sampled piano audio.
    • Applied Graphics2D and AffineTransform for an animated metronome, integrating GUI graphics, file I/O, and socket networking into a cohesive real-time multimedia system.

Work

  • 2025.12 - 2026.04

    Shanghai, China

    Data Engineer Intern
    Data Strategy Department, McDonald's China
    Worked on building and optimizing data infrastructure to support business intelligence and analytics for McDonald's China.
    • Developed and validated scalable database schemas and enterprise-level data maps.
    • Designed a similarity recommendation algorithm based on menu composition and user historical interactions.
    • Implemented an AI agent that significantly enhanced data panel classification and visualization.

Volunteer

  • 2023.05 - 2024.08
    Treasurer
    Organized events and managed club financial operations.
    • Managed financial operations and annual budgeting for student-led educational initiatives.
    • Facilitated letter exchanges and Q&A sessions with high school students from Hunan Province.
  • 2022.09 - 2026.05
    Trombone Section Leader
    Directed the trombone section, coordinated rehearsals, and oversaw instrument storage logistics.

Awards

Skills

Programming Languages
Python
Java
C
SQL
Stata
R
Tools and Frameworks
Git
PyTorch
Pandas
NumPy
LaTeX
Markdown

Languages

Chinese
Native
English
Fluent

基本信息

Name 刘兆东
Label 本科生
Email zl4789@nyu.edu
Phone +86 131-5629-6623
Summary 上海纽约大学计算机科学与数学双专业本科生,具有生成式检索与大语言模型方向的科研经历。

教育背景

  • 2026.08 - 2027.12

    匹兹堡,宾州,美国

    理学硕士
    卡耐基梅隆大学
    计算数据科学(Master of Computational Data Science)
  • 2022.09 - 2026.05

    上海,中国

    理学学士
    上海纽约大学
    计算机科学、数学(双主修),Cum Laude(荣誉毕业)
    • 海外学习经历:纽约大学纽约校区,2024 年 9 月 - 2025 年 5 月
    • 相关课程:多元微积分、荣誉线性代数、概率与统计、荣誉数值分析、离散数学、数学分析、常微分方程、随机过程、数据结构、算法、操作系统、数据库、机器学习、自然语言处理
    • 荣誉:Dean's List 院长名单(学年:2022–2025)、Founders' Day Award(年度杰出学生荣誉奖)

科研经历

  • 2025.05 - 2026.05
    上海纽约大学
    多模态推荐系统的生成式检索
    • 导师:文弘毅教授(计算机科学系、数据科学与工程系)
    • 开发了一个多模态框架模型,用于预测亚马逊数据集上的用户行为模式。
    • 设计并评估了模型架构,实现了 VQ-VAE、RQ-VAE 和 OPQ 等量化方法,以捕捉离散行为模式,从而实现了 47.2% 的性能提升(从 0.0339 提升至 0.0499)。
    • 采用了受 BPE(字节对编码)启发的分区分词技术,从而提高了序列的连贯性和表示质量。
  • 2024.05 - 2024.08
    上海纽约大学
    考虑交通信号灯时长的高效驾驶路线优化
    • 导师:陈志斌教授(计算机科学系、数据科学与工程系)
    • 在 SUMO 平台上基于真实地图及相关约束条件,构建了一个具备交通感知功能的驾驶路线优化模型。
    • 在真实交通信号数据集上构建的模拟模型,与基线路由算法相比,能够缩短出行时间。
    • 针对电动汽车和燃油汽车的能耗特性应用相应的模型,兼顾能源效率与行驶时间的平衡。
  • 2024.09 - 2024.12
    纽约大学数据科学中心
    微调预训练语言模型,完成化学反应的逆合成高效预测
    • 导师:何河教授(纽约大学数据科学中心)
    • 利用 SMILES 标记法开发了用于化学反应逆合成的预测模型。
    • 应用了大语言模型(LlaSMol-Llama2)对数据集进行重建和扩充,使 Top-1 准确率提高了 26.2%(从 0.588 提升至 0.742),同时减少了对实验室数据的依赖。
    • 采用了精细的调优策略(Encoder 参数冻结、MLP 适配器、LoRA)以优化训练效率。
  • 2024.10 - 2024.12
    纽约大学坦顿工程学院
    虚拟体育联赛经理数据库
    • 导师:Salim Arfaoui 教授(数据库导论课程)
    • 在 MySQL 中设计了包含 13 张数据表的关系型数据库,支持多联赛(橄榄球、篮球、足球)的虚拟队伍运营,涵盖用户认证、选秀调度、阵容管理与交易系统,录入 390 余条基于现实世界的样本数据。
    • 实现了符合 ACID 规范的事务管理、自动化触发器与存储过程,通过索引优化查询性能,并强制保证所有关系间的参照完整性;构建了排名算法、赛程编排与积分系统及免签机制。
    • 以 Python 构建后端服务,实现用户认证、数据查询与业务逻辑处理;以 HTML/CSS 开发前端交互界面,支持可视化阵容管理、实时积分展示及交易操作,实现前后端与 MySQL 数据库的全链路集成。
  • 2025.03 - 2025.05
    纽约大学坦顿工程学院
    多人协作钢琴应用
    • 导师:Daniel Katz-Braunschweig 教授
    • 采用 Java 全栈开发,以 Java Swing/AWT 构建图形化钢琴键盘前端界面,以 Java Socket 服务端处理多客户端连接,实现实时音频同步、在线聊天及精确时间戳的录音回放功能。
    • 基于 TCP 套接字网络与线程并发(ExecutorService、ConcurrentHashMap)实现服务端多客户端并发管理,通过 Java Sound API 支持合成波形与真实钢琴采样等多种音色。
    • 利用 Graphics2D 与 AffineTransform 实现动态节拍器动画,将前端 GUI、服务端网络通信、文件 I/O 整合为一个完整的实时多媒体全栈系统。

工作经历

  • 2025.12 - 2026.04

    上海,中国

    数据工程师实习
    麦当劳中国 数据战略部
    为麦当劳中国的商业智能与数据分析构建并优化数据基础设施。
    • 开发并验证了可扩展的数据库模式以及企业级的数据映射,整理构建数据地图。
    • 设计了一种基于菜单构成和用户历史交互的相似性推荐算法,为活动策划、物资筹备提供参考。
    • 开发 AI Agent 应用于数据检索,显著提升了数据面板的分类和可视化效果。

学生组织与活动

  • 2023.05 - 2024.08
    财务主管
    组织活动并管理社团财务运营。
    • 负责管理学生主导的教育项目的财务运作及年度预算编制工作。
    • 举办了与湖南省高中生的信件交流活动以及问题交换解答活动。
  • 2022.09 - 2026.05
    长号声部首席
    领导长号声部,协调排练事宜,并负责乐器存放的后勤工作。

荣誉奖励

技能

编程语言
Python
Java
C
SQL
Stata
R
工具与框架
Git
PyTorch
Pandas
NumPy
LaTeX
Markdown

语言

中文
母语
英文
流利