Microbial Evolution
My current research is to identify novel lineages of symbionts and understanding the molecular evolution of prokaryotes.
Computational Genomics
Baking & Black Cats Enthusiast
Computational Biologist
Specialization in Microbial Genomics
Welcome!
I'm Yumary Vasquez, a Postdoctoral Scholar at the Joint Genome Institute at Lawrence Berkeley National Laboratory.
I completed my PhD from the Quantitative and Systems Biology program at UC Merced. During my graduate education, I studied the coevolution of the nutritional symbionts with the agricultural pest, the Macrosteles quadrilineatus leafhopper.
I have a Bachelor of Science from California State University San Marcos in Biotechnology, with a minor in Computer Science. I have previous lab experience studying T-cells in obese mice & population genomics in ladybugs and parasitic wasps.
I am a passionate science communnicator.
In grad school, I was a member, treasuer, vice president and president of the RadioBio podcast.
I am the first place winner for the 2024 Berkeley Lab Research SLAM and a second place winner for the 2024 Bay Area Research SLAM. In 2025 I will be among 17 postdoctoral fellows from US DOE National Labs to compete in Washington D.C for the National Research SLAM.
General Research Interests: genomics, next generation sequencing, data science, machine learning, evolution.
My current research is to identify novel lineages of symbionts and understanding the molecular evolution of prokaryotes.
For the complete list, please check my google scholar. Email me if you have any questions about my papers.
This page is a collection of projects I have worked on, including a blog post about the work in the project.
Currently, I am working my way through the "Practical Deep Learning for Coders" course. This is a quick project that I did for the first
lesson where I adapted the code to identify bee photos compared to wasp photos. Check out the code above and watch out for more code.
I'm an avid Chinese and Korean drama watcher. When I first started grad school, I stumbled across a C-drama on Netflix, and now I mainly use Viki to watch asian dramas.
In this project, I am working on creating a recommendation system using descriptions and information from Viki dramas. Currently, this code uses TF-IDF Vectorizer from scikit-learn
in order to provide recommendations based on description, genre, or description & genre.
This is a beginning project but I plan on adding multiple features including:
1. Create a shiny app using python code (still in development)
2. Search recommendations based on genre and/or description of show
3. Show all shows that contain a certain actor/actress; as well as recommendations on other shows that are similar to the actor/actress
4. Updates to include new shows (current dataset is until May 2022)
Any suggestions to this project are welcome!
During the Summer of 2022, my husband and I talked about "what we like to do for fun." One thing we talked about is how we both love to check out breweries wherever we go.
Following this conversation, we also talked about creating a shiny map with our brewery visits. This is the beginning of that work.
This is in no way the complete list of breweries we have been to (we forgot the name of a few), but it is a starting point.
We both also have been to other breweries without each other, and I have not added those as well.
My hope for this small, for-fun project is to take in text input that will update the maps with suggestions from people who visit this site.
In 2019, I was in a semester long course at UC Merced called Interdisciplinary Computational Graduate Education (ICGE). This class was a National Research Training Program funded by the NSF.
In this class, we grouped up with other graduate students across disciplines and chose our own project. My own team had a mathmatician, a cognitive scientist, a physicist and myself (a biologist).
As a group we decide to do some twitter scraping using this TweetScraper. At first, we wanted to look at twitter discourse on the idea of climate change.
But you can imagine how many tweets revolved around climate change.... so we settled on looking at climate change tweets related to legislation. Specifically, we focused our twitter scraping on one case: Juliana vs. US
We scraped tweets between August 2015 - February 2019, and use those tweets for a sentiment analysis. We also plotted the number of tweets every month per year and connected peaks to specific climate change related events.
Our results and methods (along with our presentation) can be found on the link for this post. However, for this post I wanted to discuss a few things:
This was not my first time using a tweet scraper, but it still amazes me how easy it is to grab data from the internet just based on a hashtag.
This was my first time using data that wasn't biological, so sentiment analysis was new to me. Finally, working with people from other disciplines was a great way I was able to learn techniques from outside my field. Most of conversations today revolve around biological ideas, but during this time my conversations covered many many topics that I had never been introduced to.