Welcome to My Blog
#
🌟 About Me #
I am Zeren Shen, a Machine Learning Engineer with a focus on LLMs, Generative AI, and 3D Motion Analysis. My journey began with a solid academic foundation from the University of Toronto, where I pursued a Master’s in Mechanical and Industrial Engineering with an emphasis on Analytics, and the University of Waterloo, where I earned a Bachelor’s degree in Statistics and Actuarial Science with a minor in Computer Science. Over the years, I have had the privilege of applying these skills to various projects, where I’ve collaborated with talented teams to develop AI-driven solutions aimed at solving complex real-world problems.
- 📍 Location: Toronto, ON, Canada
- 🎓 Education:
- Master’s: Mechanical & Industrial Engineering (University of Toronto)
- Bachelor’s: Statistics and Actuarial Science (University of Waterloo)
- 📬 Contact:
- Email: zeren71415@gmail.com
- GitHub: Followb1ind1y
- LinkedIn: Zeren Shen
- Resume: Download my Resume
🚀 Professional Experience #
Machine Learning Engineer #
@ ThinkGenAI Lab Inc. - Toronto, Canada
May 2023 - Nov 2024
Research Assistant RAG System #
- Developed a RAG system that enables research teams to search, retrieve, and summarize newly published papers.
- Built an end-to-end pipeline for PDF ingestion, chunking, and vector embedding (Pinecone), integrating LangChain-powered ReAct agents for real-time retrieval, semantic search, and LLM-based Q&A.
- Containerized microservices with Docker, deployed a FastAPI backend for concurrent request handling and built a Discord bot with conversation summarization, memory management, and PDF export features.
LLM-Powered Dialogue Toy Robot
- Integrated Whisper-STT, GPT-3.5, and Google TTS for dialogue robot pipeline, achieving 1.2s end-to-end latency.
- Developed a high-performance FastAPI service with async/await to enable parallel voice processing and LLM inference, handling 20+ requests per minute through request batching.
- Reduced response time by 50% through optimized pipelining, leveraging audio chunk streaming and token-level generation for seamless user interaction.
Research Assistant #
@ Laboratory for Extreme Mechanics & Additive Manufacturing - Toronto, Canada
Sep 2021 - May 2022
- Implemented TensorFlow-based U-Net models for real-time X-ray image segmentation, achieving 0.93 MeanIoU in pore detection and reducing manual inspection time by 40% through automated defect-tracking algorithms.
- Optimized bubble detection latency by 30% via model quantization and multi-threaded data pipelines, deploying the system as Docker containers integrated with industrial 3D printers.
- Publication: Zhang, J., Lyu, T., Hua, Y., Shen, Z., et al. Image Segmentation for Defect Analysis in Laser Powder Bed Fusion. Integr Mater Manuf Innov 11, 418–432 (2022).
Autonomous Driving Research Intern #
@ Zhejiang University May 2018 - Aug 2018
- Led critical data collection initiatives from driving environments to support further training of computer vision models, contributing to the research in self-driving vehicle technology.
- Collected and analyzed data from over 50 driving hours, capturing 100,000+ frames of video data and 500,000+ data points for various sensor modalities, including LiDAR and GPS.
🛠️ Projects #
Medical-LLM-Fine-tuning: Fine-tunes LLaMA-3-8B on PubMedQA with QLoRA, optimized via DeepSpeed and vLLM for efficient, low-latency medical QA. Deployable via Docker for scalable clinical inference.
AgentRAG: AgentRAG is a powerful AI-driven system that integrates web crawling, scraping, vector search, and intelligent chatbot interactions to process online information.
State Farm Distracted Driver Detection: Developed advanced image classification models using ensemble methods to identify distracted driving behaviors from dashboard camera images, improving safety insights and insurance applications.
Semantic Segmentation of Aerial Imagery: Implemented semantic segmentation models to classify every pixel in aerial images into six categories, enabling precise analysis of urban landscapes and infrastructure.
Face Mask Object Detection: Built real-time object detection models to identify and classify masked and unmasked faces, addressing public health needs during the COVID-19 pandemic with live-stream and video applications.
Explore more on my GitHub!
📚 Key Skills #
Skills | |
---|---|
Core Programming | Python, C++, NumPy, Pandas, Matplotlib, Bash |
Machine Learning | TensorFlow, PyTorch, Keras, OpenCV, Transformers, GPT |
Dev Tools | Git, Docker, Jupyter, AWS, Microsoft Azure |