Watershed Bio

Watershed Bio

Biotechnology Research

Cambridge, Massachusetts 2,926 followers

Powering insights across the life sciences Learn more at watershed.bio

About us

Watershed is the complete solution for biological data analysis, discovery, and collaboration. Our platform gives every lab the power of a dedicated computational core in one integrated environment, purpose-built for biology: - Securely access, manage, and harmonize complex datasets, including public and controlled-access data. - Run and track analyses on every data type using customizable workflows, including ready-to-use AI tools, all while upholding FAIR principles. - Leverage powerful supercomputing infrastructure and resources to complete large-scale analyses in minutes instead of weeks. - Access a dedicated bioinformatics team for everything from technical support to rigorous partnership. Email contact@watershed.bio or visit our website at http://watershed.bio to learn more or schedule a live demo.

Website
https://watershed.bio
Industry
Biotechnology Research
Company size
11-50 employees
Headquarters
Cambridge, Massachusetts
Type
Privately Held
Specialties
bioinformatics, biological data analysis, drug discovery, genomics, transcriptomics, proteomics, epigenomics, sequencing data analysis, drug development, biopharma, biological research, and biomedical research

Locations

Employees at Watershed Bio

Updates

  • ✈️ Swipe for a sneak peek at our upcoming 2025 events! ✈️ Last year we had the chance to travel across the country from the American Association for Cancer Research in San Diego, BioTechX in Philadelphia and FOG (Front Line Genomics), BioIT (Cambridge Healthtech Institute) & BioProcess International right here on our doorstep in Boston. These conferences allowed us to share our advancements in high performance computing and deepen our engagement with the broader biotech community.

  • Watershed Bio reposted this

    View profile for Griffin Seidel, graphic

    Empowering Bioinformatics @ Watershed Bio

    It was great to connect with others in the industry at the #BostonEpigeneticsSociety Holiday Party at Lamplighter Brewing Co. this week! I had a great time discussing experiences in business and Bioinformatics in 2024, meeting a bunch of new faces, and having a couple of tasty beverages in the process. All around a great way to cap off such an exciting year, for myself and for the team at Watershed Bio!

    • No alternative text description for this image
  • 🧠 According to PubMed, 150 studies introducing or citing a “foundation model” have been published in 2024 alone. Before 2023, that number was <10 per year. #FoundationModels use machine learning algorithms to, often in a self-supervised manner, infer patterns and make predictions from massive, complex datasets. In our latest blog, Silvin Gol and Evelien Schaafsma, Ph.D. highlight several foundation models trained on diverse biological data types, from #transcriptomics and #genomics to biomedical language and clinical practice. They then explore how several of these models have been used in recent studies to guide research and form new hypotheses in oncology, stem cell biology, infectious disease, and beyond. https://lnkd.in/erSWECVF #spatialbiology #cancerresearch #biotech #machinelearning

    • No alternative text description for this image
  • 🏆 The Nobel Prize in Chemistry this year went to the creators of #AlphaFold: John Jumper, Demis Hassabis, and David Baker. Check out this article for examples of how AI-powered protein design is changing #biotech.

    View profile for Rowan Walrath, graphic

    Covering the business of drugmaking for Chemical & Engineering News.

    When Schrödinger launched more than 30 years ago, computational biology was a nascent technology. Schrödinger, a pioneer in the field, relied heavily on the Protein Data Bank (PDB), a collection of solved protein structures that got its start in the 1970s. “Getting a structure in the lab used to be—and still is—difficult,” says Karen Akinsanya, Schrödinger’s president of therapeutics R&D. Back then, a single protein structure would potentially comprise a person’s entire PhD or postdoctoral research, taking “years of work,” she says. The entire field of structure-based drug design took a giant leap forward with the advent of #AlphaFold, the protein prediction software that launched in 2020 and this week became a Nobel Prize–winning technology. AlphaFold, which is owned by Google’s DeepMind, built on the PDB and other protein sequence databases, has used a neural network to predict the structures of now millions of proteins. Schrödinger and its ilk use AlphaFold to help dream up drug candidates with a higher degree of specificity than what was previously possible. Akinsanya says her team is using a better understanding of how proteins fold to design, for instance, small molecules that change how those proteins interact with each other. Recently, a Schrödinger team used structure predictions of a protein encoded by the human ether-à-go-go–related—or hERG—gene, to see how 14 compounds would bind with it, then design drugs that would avoid the protein since inhibiting it can elicit severe cardiotoxic side effects (Cell 2024, DOI: 10.1016/j.cell.2023.12.034). “It’s accelerating the work of humans. There’s no doubt about that,” Akinsanya says of AlphaFold. “We are in the century of the protein.” It’s also the century of the algorithm. More in C&EN: https://lnkd.in/eBXX_D9U

    ‘We are in the century of the protein’

    ‘We are in the century of the protein’

    cen.acs.org

  • 🧬 ✂️ Up to half of the human genome is derived from #transposons – ”jumping genes” that can change their genomic location by copying or excising themselves. Since their discovery by Dr. Barbara McClintock in 1950, transposons have been harnessed as a genetic editing tool in both basic and clinical research. So how exactly do transposons move around, and how does this make them biomedically useful? Which FDA-approved therapeutics use transposon-based technologies today? Check out our latest mini review for answers to these and other questions! https://lnkd.in/gnSPDmd5 #geneediting #biotech #biopharma #genetherapy

    Transposons: Jumping from the Lab to the Clinic

    Transposons: Jumping from the Lab to the Clinic

    watershed.bio

  • Important considerations for using UCE and potentially other #foundationmodels like Geneformer and scGPT 💡

    View profile for Tyler Burns, PhD, graphic

    Single-Cell Biology Strategy Consultant & Educator | Biosecurity Researcher

    In a standard scRNA-seq analysis pipeline, you select the top ~2000 variable genes for downstream analysis (eg. clustering). However, my recent experiment suggests that you should not do this for foundation models. Here is what I did... The Universal Cell Embeddings (UCE) foundation model, part of a bigger "virtual cell" initiative, takes a raw cells x counts matrix as input and outputs a 1280 dimensional vector that contains biological meaning as output. This is then used for downstream analysis. The power here is that you get the same vectors every time. There is no fine-tuning of the model. So you can make comparisons with any datasets that have never been run through the model, and therefore do things like annotate, given metadata cells from other datasets. As I said in a previous post, this can take a long time if you're running it locally. One hypothesis, inspired by one of the comments, was that I could put in an abbreviated dataset of only variable genes, and get a faster result without sacrificing accuracy - a good thing when computational resources are limited. Experimental design: I ran the following 3 datasets through UCE. 1. The full dataset (positive control). 2. The dataset containing the most variable genes (experimental). 3. The dataset containing a random selection of genes (negative control). My results: I found that the dataset containing the most variable genes did not have the same level of cell type separation compared to the full dataset, with the negative control performing worse than both of them. This can be seen by assessing PCA space of the concatenated data (image below). Further quantification via Shannon entropy (to measure diversity) confirms this (see my jupyter notebook in the comments). What this means for you: This suggests that for UCE, and perhaps for other foundation models (geneformer, scGPT), you should run the full dataset through it to get the best results, and the typical practice of only selecting variable genes may not apply to the use of foundation models. Zooming out: There has been an uptick in people asking me questions around AI as it relates to single-cell in the past few weeks (perhaps because I'm posting about it). Even if you're a natural skeptic (like me), you should at least be familiar with them, because like the black boxes before it (eg. t-SNE/UMAP), these tools don't appear to be going anywhere. And they do indeed have potential to accelerate our workflows. If you are doing work in this space, or interested in doing work in this space, please let me know. A jupyter notebook showing my work is linked in the comments. I hope you all have a great day.

    • No alternative text description for this image
  • 💡 This new study uses single-cell resolution Imaging Mass Cytometry (IMC) to profile the cellular landscape of pancreatic ductal adenocarcinoma (#PDAC), one of the most lethal types of cancer. #PancreaticCancerAwareness

    View profile for Marco Erreni, graphic

    Imaging Technologist presso Multiscale and Nanostructural Imaging Unit, IRCCS Humanitas Research Hospital - Adjunt professor presso Humanitas University

    I am pleased to share our latest publication on pancreatic adenocarcinoma and Imaging Mass Cytometry (IMC). Using IMC at single cell resolution, we have identified 19 distinct subpopulations of cancer-associated fibroblasts that differ in phenotype, spatial localisation, interaction with immune and tumour cells, and association with patients' CA19-9 levels and prognosis. A special thanks to our collaborators from the Pathology Unit, Medical Oncology and Haematology Unit and Pancreatic Surgery Unit at the IRCCS Humanitas Research Hospital for their amazing contribution! Hope you enjoy reading! https://lnkd.in/eT-wex8z

    Frontiers | Depicting the cellular complexity of pancreatic adenocarcinoma by Imaging Mass Cytometry: focus on cancer-associated fibroblasts

    Frontiers | Depicting the cellular complexity of pancreatic adenocarcinoma by Imaging Mass Cytometry: focus on cancer-associated fibroblasts

    frontiersin.org

Similar pages