Skip to content
View 2023lic14's full-sized avatar

Block or report 2023lic14

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
2023lic14/README.md

Claire Li banner

Claire Li

๐Ÿ‘‹ Hi, I'm Claire Li!

๐ŸŽ“ Computer Science Student @ Columbia University
๐Ÿ”’ Passionate about AI/ML, NLP, and Full-Stack Development
๐ŸŒ Building secure, intelligent software to solve real-world problems


๐Ÿš€ About Me

I'm currently a CS student at Columbia with hands-on experience in machine learning, cybersecurity, and software & web development. Iโ€™ve worked on everything from training U-Net models for medical imaging to leading malware analysis investigations and building AI-enabled web apps. As is true for all my projects, I love combining technical depth with real-world impact.

Aside from CS, I love playing video games and board games, drawing, cubing, and traveling! Ask me about Tetris or my recent trip to Peru!


๐Ÿ› ๏ธ My Tech Stack

Languages:
Python | Java | C | JavaScript | TypeScript | HTML/CSS | MIPS | Shell Script/Bash

Frameworks & Tools:
React.js | Next.js | Node.js | Git | Tailwind CSS | Linux | SolidWorks | Autodesk CAD | MATLAB | Kivy | TCP Sockets

ML & Data Science:
PyTorch | Keras | Pandas | NumPy | Scikit-Learn | Matplotlib | Seaborn | Tensorflow | BERT | HuggingFace

Cybersecurity Tools:
Kali Linux | Wireshark | REMnux | Metasploit | Burp Suite | Zeek | Volatility | VirusTotal | Joe's Sandbox | WHOIS | Threatminer | AnyRun

Databases & APIs:
MongoDB | REST APIs | OpenAI API

Hardware: Arduino | Laser Cutting | 3D Printing | Soldering


๐Ÿ”ฌ Experience

๐Ÿ’ป Break Through Tech AI @ Cornell Tech

May 2025 โ€“ August 2025

  • Completed and received a certificate for Cornell University's ML Foundations course
  • Preprocessed large datasets (windsorizing, filling missing data, one-hot-encoding, n-gram creation, lemmatization, stop word removal)
  • Built and deployed classification and clustering models (KNNs, Decision Trees, Linear/Logistic Regression, CNNs etc.)
  • Employed model selection, ensemble, and tuning methods to create more refined models (Cross-validation, Grid Search, Stacking, Random Forest, Gradient Boosting)
  • Evaluated ML models' functionality by interpreting metrics and confusion matrices, AUC-ROC curves, etc.
  • Built efficient NNs trained using stochastic gradient descent (Sentiment Analysis, Image Classification)

August 2025 โ€“ May 2026

  • Will work with mentors in the industry along with a team of peers to apply ML knowledge to the real world in impactful ways (Project Pending)

๐Ÿง  Research Assistant @ University of Minnesota CMMR Lab

Jul 2024 โ€“ Dec 2024

  • Built a U-Net model using PyTorch to denoise MRI images
  • Used Matlab to preprocess and become familiar with the datasets
  • Conducted training and testing on supercomputer through VNC
  • Tools: Python, Linux, Matlab, Git, VNC

๐Ÿค– First Tech Challenge Robotics

2016 โ€“ 2023

  • Designed, prototyped, programmed, and assembled robot
  • Implemented augmented reality and AI recognition on robot using tensorflow and Vuforia
  • Utilized Java to program robot, CAD to model parts for printing and assembly
  • Competed against other teams in FTC tournaments, troubleshot errors, and drafted design reports

๐Ÿ›ก๏ธ Beaver Works Summer Institute @ MIT โ€“ Cyber Ops Class Leader

Jun 2022 โ€“ Aug 2022

  • Participated in labs, worked in fireteams, and attended lectures held by cybersecurity professionals
  • Completed execises on Hackthissite and Bandit
  • Used industry tools such as VirusTotal, Metasploit, Zeek, and Wireshark, etc.
  • As class leader, oversaw PCAP Investigation, Malware Analysis, Memory Dump and OSINT teams, facilitated communication between mentors and peers, organized class meetings, and kept things on schedule
  • Worked with OSINT team to profile Raccoon Stealer malware and presented findings to an audience for Beaver Works' Summer Showcase

๐Ÿ“Š Data Science and Artificial Intelligence Program @ Wells Fargo

Jun 2020

  • Explored and visualized datasets using Python tools such as Pandas and MatplotLib
  • Attended lectures by professionals within the Data Science industry
  • Conducted statistical modeling using Python and worked in teams to conduct research and present projects
  • Designed and pitched iSweat, an exercise monitoring app which utilizes machine learning and artificial intelligence to prioritize user health and experience

๐Ÿ’ก Projects

๐Ÿ•ต๏ธโ€โ™€๏ธ Identifying Possible Systemic Gender Discrimination Using ML (July 2025)

Problem: Gender discrimination is a prominent issue in America. What is a way to show it exists?

Solution: By training a neural network on US census data, we could identify, based on the accuracy of the model, if there were statistically significant differences between gender using demographic data that are not directly relate to gender such as education, occupation, income, etc. Then, by identifying a difference, we are able to investigate* whether or not systemic discrimination is present

This project is exploratory and not intended to assign causality. Further investigation is necessary to interpret correlations ethically and accurately.

  • Faithfully followed the procedures of the ML lifecycle
  • Preprocessed and conducted feature engineering on the 1994 US census data (windzorized all numerical columns, filled missing entries, one-hot encoded all categorical columns, removed irrelevant features, identified and addressed class imbalance, split train and test data, etc.)
  • Trained using Tensorflow Keras' Sequential model with 3 hidden layers of 32, 16, and a 8 units fine-tuned to avoid over/underfitting and uses SGD to optimize performance
  • After 100 epochs, accuracy testing resulted in a loss of 0.4567 and accuracy of 0.7775. Created confusion matrix identifying high count of Type II error

Conclusion: The model performed well, though not exceptionally, still signifying statistical correlation between demographic information and gender identity**

*It is important to conduct investigation as to which features are most strongly correlated with these results. Categories such as occupation and income may be correlated with gender in a non-discriminatory way. Next steps to do so include plotting features against the label to determine highest correlation.

**This type of ML model has the potential to be used unethically. That is not the intention of this project.

๐Ÿ”ข Neural Network From Scratch (June 2025)

Created a convolutional neural network from scratch inside kaggle using no ML libraries, only linear algebra using NumPy, Pandas, and MatPlotLib

  • Trained on the MNIST dataset, the model was able to achieve above an 80% accuracy rate, further training may allow better performance
  • Yielded similar results to a model I created using Keras on the same dataset

๐Ÿงฑ A* NxN Rubikโ€™s Cube Solver (May 2025)

After cubing for 10+ years and still not being able to achieve a sub-20 second solve on the 3x3, I began wondering if I could create a a Python-based solver using A* that would find better solves than me.

  • Implemented an A* search algorithm using a priority queue with a custom heuristic
  • Avoids redundant moves and optimizes for efficiency by using a pruning algorithm
  • Uses a max height variable that corresponds to the NxN cube's God's number (the maximum number of moves needed to solve any given cube state in the least rotations)

๐ŸŒ SNAP for Students (Feb 2025)

Problem: A study by UC Davis found that 19% to 56% of U.S. college students struggle with food insecurity. Many students would assume they donโ€™t qualify, and the NYC online SNAP application is outdated and difficult to navigate.

Solution: We created a website, SNAP for students, to make the process more digestible. Our site is a full-stack website to simplify SNAP benefits access for students

  • Built the frontend using React.js, Next.js, TypeScript, Tailwind CSS, HTML, and JavaScript
  • Designed and implemented a MongoDB-powered forum with comment-liking, upvoting/downvoting, post sorting, and reply features and utilized Node.js for backend functionality
  • Integrated an OpenAI-powered chatbot with prompt engineering and eligibility quiz to help users determine SNAP qualification

๐Ÿงพ HTTP 1.0 Web Server in C (Dec 2024)

Built a TCP-based HTTP server from scratch in C

  • Handles static + dynamic content, error handling, and logging
  • Implements directory traversal protection and IP tracking
  • Manages all memory manually using malloc() and free(), demonstrating in-depth understanding of memory allocation.
  • Runs on command line, demonstrating high level understanding of low level code, file reading/writing, TCP sockets, stdin/stdout/stderr, REST API etc.

๐Ÿ—ฃ๏ธ Semantic Role Labeling with BERT (Nov 2024)

  • Developed a deep learning model in Python using PyTorch and Hugging Face's Transformers library to perform Semantic Role Labeling
  • Fine-tuned a BERT model on OntoNotes 5.0 dataset, using contextual embeddings for sequence labeling
  • Implemented tokenization and BIO-tag alignment, ensuring accurate mapping of subwords to semantic roles
  • Trained and evaluated the model, achieving high accuracy and F1-score

๐Ÿ”ค Anagrams Finder (May 2024)

Created a Java program to find anagrams using advanced data structures

  • Built Hashmaps, Red-Black Trees, and Linked Lists from scratch to create 3 different ways to find anagrams
  • Took user input for data structure and word, and produced all anagrams from a dictionary.txt file

๐ŸŒฑ Eco Escape (Feb 2024)

Created a simple environment-themed puzzle game that combines my love for escape rooms with environmental awareness

  • Programmed using Python Kivy and hosted as a website on Google Cloud Platform with a bucket and load balancer
  • Illustrated all visual elements

๐ŸŽฐ Handheld Slot Machine (Sep 2023 - Nov 2023)

Created a fully functional handheld slot machine that had working lights, music, a gear system, and laser coin detection to be presented at Columbia's Art of Engineering fair

  • Designed, 3D-printed, and laser-cut parts
  • Wired and programmed multiple Arduino Uno R3

๐Ÿš€ Miniature Rocket (Sep 2023)

Created a launchable ~16โ€ rocket with an engine and extendable glider, designed parts using SolidWorks, 3D printed and laser cut parts

  • Launch height ~70 feet
  • Landed well with no damage to the rocket

๐Ÿ’ผ Leadership & Activities

  • ๐Ÿ“ฃ Event Coordinator โ€“ Women in Computer Science (WiCS)
  • ๐Ÿš€ Columbia Space Initiative
  • ๐Ÿ‘ฉ๐Ÿปโ€๐Ÿ’ป Columbia App Development Initiative
  • ๐ŸŽฎ Team Leader โ€“ Columbia Tetris League
  • ๐Ÿ“š Math Tutor โ€“ Reading Team Math

๐Ÿ“ซ Connect With Me

๐Ÿ“ง Email: [email protected]
๐ŸŒ GitHub: github.com/2023lic14
๐Ÿ‘ฅ LinkedIn: https://www.linkedin.com/in/claire-li-178152272/


โ˜• Fun Fact

Always down for a Maiko Matcha Soft Serve and always looking to talk about my super cool (albeit not very large) Charizard Pokรฉmon card collection!


Thanks for stopping by! ๐Ÿš€

Popular repositories Loading

  1. devfest2026 devfest2026 Public

    TypeScript 1 1

  2. hello-world hello-world Public

    yay

  3. patchwork patchwork Public

    Forked from jlord/patchwork

    All the Git-it Workshop completers!

    Rich Text Format

  4. Eco-Escape Eco-Escape Public

    DevFest 24 Hackathon

    HTML 1

  5. denoising denoising Public

    Python

  6. lab-3 lab-3 Public

    Forked from coms2132-sp25/lab-3

    Python