print("Hello, I am Yingfan!")

> Hello, I am Yingfan!

Welcome to my page! You must be wondering who I am?

Well let me give you a brief introduction. My name is Yingfan (Eric) Chen and I am currently a senior at the University of Washington -- Seattle. I am finishing up my bachelor's degree in Computer Science at the Paul G. Allen School of Computer Science and Engineering in June 2023.

Résumé

You can see my full resume above, but here is a quick summary of the tech skills I have acquired through previous internships and coursework.

Programming Languages

  • Java
  • Python
  • Javascript
  • Scala
  • SQL
  • C/C++

Frameworks / Tools

  • SpringBoot
  • PySpark
  • NodeJS
  • Database (MySQL, MS SQLServer, H2, Presto, Vertica, ClickHouse)
  • Git
  • Docker
  • Kubernetes/Openshift
  • DevOps CI/CD
  • LaTeX

Work Experiences

Here is a brief summary of my previous internships. Please feel free to reach out to me if you are curious about my internship experiences.

Google

September 2022 - December 2022 | Software Engineering Intern

What is Google?
Yo! I am sure you know what is Google :D

What am I going to do?
  • I will be working in Google Photos Team under Platforms & Ecosystems.

Palo Alto Networks

June 2022 - September 2022 | Big Data Software Engineering Intern

What is Palo Alto Networks(PANW)?
Palo Alto Networks is an American multinational cybersecurity company with headquarters in Santa Clara, California. Its core products are a platform that includes advanced firewalls and cloud-based offerings that extend those firewalls to cover other aspects of security. During my internship, I worked as a Big Data Software Engineer Intern in the Big Data Team in Palo Alto Networks. IT.

What did I do?
  • Data Management: Built distributed in-memory applications using Spark (PySpark and Spark Scala API) and used YARN as resource manager to increase analytics efficiency on huge data sets by 40%. Conducted data profiling and data visualization to help make business decisions.
  • Data monitoring: Built a framework to retrieve data from two data sources (Google Sheet & Big Query Dataset) and alert the business regarding the difference after comparison, improved the data pipeline feedback by 60%.
  • Data Streaming: Utilized Kafka and Spark-Streaming to gain real-time streaming data in HDFS from multiple data sources for analysis, improving the pipeline efficiency by 30%.

Costco IT

Jan 2022 - April 2022 | Software Engineering Intern

What is Costco?
Costco Wholesale Corporation is an American multinational corporation which operates a chain of membership-only big-box retail stores. During my internship, I worked as a Software Engineer Intern in the Enterprise Service Development Team in Costco IT.

What did I do?
  • RESTful Microservices Development: Designed and developed RESTful Microservices using SpringBoot framework, integrated with Junit/SOAtest for service functionality. Implemented Docker container to create service image on Quay, and dynamically provision slaves to Jenkins CI/CD pipelines.
  • Service Deployment: Created projects/configMap/secrets on Openshift for deployment on Dev/Test/Prod environments. Troubleshot pods through ssh and logs. Worked on developing ADO pipeline for automatic deployment and configuration of RHEL servers for test and prod environments.

Tesla

July 2021 - September 2021 | Software Engineering Intern

What is Tesla?
Unless you have been living in isolation without access to the internet for the past decade, I don't think there needs to be much explanation here. During my internship, I worked as a Software Engineer Intern in the Big Data Team in Tesla.

What did I do?
  • Data Visualization: Developed real-time production line dashboards using Dash & Echarts framework in Python that triggers synchronous data fetching from databases(Presto + Vertica), and instant front-end display. Reduced the onsite feedback cycle from ten minutes to almost real-time.
  • Data Warehousing: Built and deployed high-availability ClickHouse cluster using Zookeeper on Linux servers. Conducted Kafka injection into ClickHouse for stream-data processing and multi-platform data migration. Implemented the distributed database management system demo as the proof of concept using Docker (github/yingfc/clickhouse_cluster).
  • Dev Environment: Combined python dependencies of dashboard projects into an all-in-one customized artifact to standardize dashboard development for higher efficiency, and organized ClickHouse cluster environment properties on Docker and AWS EC2.

UW Sensors, Energy, and Automation Laboratory (SEAL)

March 2021 - May 2021 | Software Developer

What is SEAL?
SEAL is University research laboratory located in the University of Washington Department of Electrical & Computer Engineering. Projects at SEAL are focused on sensor and sensor array design using electric, magnetic, acoustic, and thermal fields. SEAL focuses on developing new products in the area of medical and green energy technologies. The lab performs R&D and P&D by working in collaboration with the NIH, the NSF and other research organizations introducing cutting edge technology to the market.

What did I do?
I worked in the UW SEAL as an App/Web developer, responsible for the following to offer tutorials relating to development using Flutter.
  • Code environment setup: Set up Flutter development environment as project skeleton for three teams. Offered tutorials for Flutter development, including basic syntax and app interactivity.
  • Project DevOps setup: Built three DevOps CI/CD pipelines for project development, standardized auto building and testing to improve development efficiency.

Momenta.ai

July 2020 - October 2020 | Software Engineering Intern

What is Momenta.ai
Momenta is a leading autonomous driving technology company. It is dedicated to reshaping the future of mobility by offering solutions to enable multiple levels of driving autonomy. It has pioneered a unique scalable path toward fully autonomous driving by combining a data-driven approach with iterating algorithms and focusing on both Mpilot, its mass-production-ready highly autonomous driving solutions, and MSD (Momenta Self-Driving), its driving solution targeting full autonomy.

What did I do?
I worked as a Software Engineer Intern in the department of HD map at Momenta.ai, working on map data compilation and path planning.
  • HD Map Data compiler: Developed a High-Definition map data pipeline with functional programming by Scala, processing map data from raw reads to organized data via topology reorganization and semantic computation.
  • Map Data compilation: Implemented topological filtering method to cluster map data via features, enabled visual debugging and reduced the dependency on upstream data accuracy, reduced compilation time of 200km of high-way raw data from 40 minutes to 20 minutes.
  • DevOps Workflow Optimization: Created and maintained the Azure DevOps CI/CD pipeline for feature development, and set up the Staging-Prod pipeline triggers to enable auto-tagging for release and auto-deployment on Kubernetes to improve project deployment efficiency by 40%.

As a Learner...

Self-learnings

I am constantly trying to acquire more knowledge about the technology we use today. Whether it is inside a lecture hall or educating myself through google, I find a great curiosity in the software that surrounds us. To keep myself motivated, I try to learn teach myself something new every month on top of the learnings I acquire through coursework.

Current Topic:
Distributed Systems

Coursework

Here are all of the CSE courses I have taken at the University of Washington -- Seattle. Feel free to reach out to me if you have any questions about the courses I have taken or need any advice on any particular course.

Currently Enrolled
Name Course ID Description
Database Systems Internals CSE 444 This senior-level course will cover the relational data model and the SQL query language. Conceptual modeling: entity/relationships, normal forms. XML, XPath, and XQuery. Transactions: recovery and concurrency control. Implementation of a database system. A medium sized project using a rational database backend.
Computer Communication Networks CSE 461 This senior-level course will cover computer network architectures, protocol layers, network programming. Transmission media, encoding systems, switching, multiple access arbitration. Network routing, congestion control, flow control. Transport protocols, real-time, multicast, network security.
Computer Science Classes
Name Course ID Time Taken
Distributed Systems CSE 452 Winter 2022
Intro to Artificial Intelligence CSE 473 Autumn 2021
Machine Learning CSE 446 Autumn 2021
Intro to Algorithms CSE 421 Autumn 2021
Foundations of Computing II CSE 312 Summer 2021
Data Structures and Parallelism CSE 332 Spring 2021
System Programming CSE 333 Spring 2021
Foundations of Computing I CSE 311 Spring 2021
Software Design and Implementation CSE 331 Winter 2021
System and Software Tools CSE 391 Autumn 2020
Introduction to Database Systems CSE 414 Autumn 2020
The Hardware/Software Interface CSE 351 Autumn 2020
Web Programming CSE 154 Autumn 2020
Intermediate Programming Concepts and Tools CSE 374 Summer 2020
Data Structures and Algorithms CSE 373 Spring 2020
Intro to Data Science CSE 180 Spring 2020
Computer Programming II (Intro to Java II) CSE 143 Winter 2020
Computer Programming I (Intro to Java I) CSE 142 Autumn 2019
Math Courses
Name Course ID Description Time Taken
Advanced Multivariable Calculus MATH 324 Topics include double and triple integrals, the chain rule, vector fields, line and surface integrals. Culminates in the theorems of Green and Stokes, along with the Divergence Theorem. Summer 2020
Elements of Statistical Methods STAT 311 Elements of good study design. Descriptive statistics including correlation and regression. Introductory concepts of probability and sampling; binomial and normal distributions. Basic concepts of hypothesis testing, estimation, and confidence intervals; t-tests and chi-square tests. Spring 2020
Beginning Scientific Computing AMATH 301 Introduction to the use of computers to solve problems arising in the physical, biological, and engineering sciences. Application of mathematical judgment, programming architecture, and flow control in solving scientific problems. Introduction to MATLAB or Python routines for numerical programming, computation, and visualization. Spring 2020
Intro to Differential Equations MATH 307 Introductory course in ordinary differential equations. Includes first- and second-order equations and Laplace transform. Winter 2020
Calculus III MATH 126 Introduction to Taylor polynomials and Taylor series, vector geometry in three dimensions, introduction to multivariable differential calculus, double integrals in Cartesian and polar coordinates. Winter 2020

Connect With Me!

Click on the icons to get redirected to my other pages! Please feel free to check me out on other social media platforms or send me an email if you would like to get in touch with me.