Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Which KP can provide metabolic pathway data for Translator? #64

Open
karafecho opened this issue Dec 15, 2022 · 12 comments
Open

Which KP can provide metabolic pathway data for Translator? #64

karafecho opened this issue Dec 15, 2022 · 12 comments
Assignees
Labels
Hammerhead (Sprint 6) - due Oct 4 in CI This ticket will be fixed in CI by the end of Hammerhead (Sprint 6) (Oct 4) KP - data/attribute gap

Comments

@karafecho
Copy link

karafecho commented Dec 15, 2022

The QotM identified PharmGKB, RaMP DB, Rhea-DB, and KEGG Pathway DB as potential sources of metabolic pathway data.

@sierra-moxon sierra-moxon changed the title Metabolic pathway data Which KP can provide metabolic pathway data for Translator? Dec 15, 2022
@marcdubybroad
Copy link

@karafecho The Genetics KP team is in the process of promoting new data through our stack that has KEGG and WIKIPATHWAYS data. It's currently in our dev stack.
We've only loaded disease/pathway association data where the p-value is less than 0.05.
The p-values are calculated from the Magma process aggregation of the genes' p-values in the pathway for that disease/phenotype.
Example query for our dev server (https://translator.broadinstitute.org/genetics_provider/trapi/v1.3/query)

{
    "message": {
      "query_graph": {
        "nodes": {
          "n0": {
            "ids": ["KEGG:hsa05322"]
          },
          "n1": {
            "categories": ["biolink:NamedThing"]
          }
        },
        "edges": {
          "e01": {
            "subject": "n0",
            "object": "n1"
          }
        }
      }
    }
  }

returns:

                        {
                            "attribute_type_id": "biolink:p_value",
                            "original_attribute_name": "pValue",
                            "value": 0.0019390149921821016
                        }
                    ],
                    "object": "EFO:0004308",
                    "predicate": "biolink:genetic_association",
                    "subject": "KEGG:hsa05322"
                }

@karafecho
Copy link
Author

Thanks, Mark! This type of query could be very useful for Translator.

@sstemann
Copy link

@sierra-moxon @karafecho NCATS will be creating an RaMP DB KP over the summer

@karafecho
Copy link
Author

Wow! Thanks for the update, Sarah.

@andrewsu
Copy link

andrewsu commented Mar 6, 2023

@sstemann Can you put us in touch with the developers of the RaMP API? Since the API already exists, it's possible we can annotate/register it in SmartAPI and have it accessible through BTE without any further development. We were about to get on that, but have one detail we'd like to follow up on first.

@sandrine-m sandrine-m added this to the D: Fall - 2023 milestone Jul 25, 2023
@sandrine-m
Copy link

This is linked with #2

@sierra-moxon
Copy link
Member

@andrewsu @sstemann - did either of these resources get incorporated into a Translator resource or should we keep this ticket open for further work?

@andrewsu
Copy link

Service Provider provides access to a rhea API -- more info in biothings/mychem.info#37

RaMP is ongoing -- linking the currently open issue at biothings/biothings_explorer#705

@codewarrior2000
Copy link

codewarrior2000 commented May 18, 2024

MolePro is in the process of assimilating the Reactome DB, which is "an open-source, open access, manually curated and peer-reviewed pathway database".

In Reactome, the metabolic pathways are grouped under Metabolism, which has the Reactome identifier R-HSA-1430728.

@sstemann
Copy link

sstemann commented Aug 6, 2024

we deferred to BTEs RaMP integration. If any of these are targeted for this funding phase of the program can folks please add a release (guppy, hammerhead, etc) to the label? @codewarrior2000 @andrewsu

@sstemann sstemann removed this from the D: Fall - 2023 milestone Aug 6, 2024
@codewarrior2000 codewarrior2000 added the Hammerhead (Sprint 6) - due Oct 4 in CI This ticket will be fixed in CI by the end of Hammerhead (Sprint 6) (Oct 4) label Sep 1, 2024
@andrewsu
Copy link

andrewsu commented Sep 5, 2024

Given the recent emphasis on eliminating redundant ingestion of resources into Translator, I'm rethinking how we approach RaMP. TLDR: I don't think we should ingest RaMP directly, but rather focus on Lipid Maps, the only resource that isn't otherwise already present in Translator. More details below...

According to https://rampdb.nih.gov/about, RAMP is an aggregator of the following resources:

HMDB v5.0 (2021-11-17)
Reactome v87 (Dec 2023)
WikiPathways v20240210 (2024-02-10)
KEGG from HMDB (v5.0) (2021-11-17)
ChEBI Release 231 (2024-03-01)
Lipid Maps Release 2024-03-06
Rhea Release 131 (2024-01-)

My analysis of each one of those resources:

HMDB

Reactome

Wikipathways

KEGG

ChEBI

Lipid Maps

Rhea

Bottom line, the only one that isn't currently ingested into Translator is Lipid Maps. Unless there are objections, I think we should:

There also is the issue of whether the resources above have been captured as completely in the sources above as they have been in RaMP, but barring a drastic difference in completeness/accuracy, I think we should focus on improving the existing resources over adding new ones.

@Mathelab
Copy link

Mathelab commented Sep 11, 2024

@andrewsu I agree that it makes more sense to improve upon the existing resources. A couple of nuances worth mentioning:

  • RaMP does 'harmonize' metabolite names across the resources, which is actually very difficult, particularly for lipids.
  • Biochemical pathways and chemistry seem to be pulled into Translator from other sources listed above so those may not be useful. Not sure whether Classyfire is being pulled in but that ontology is very widely used in the metabo community.
  • Seems like you could pull from LIPID MAPS directly or through RaMP to take advantage of the 'harmonization' and our parsing code (which is also public).
  • RaMP pulls in ontologies from HMDB not sure whether those would be useful.

Happy to discuss more during the relay as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Hammerhead (Sprint 6) - due Oct 4 in CI This ticket will be fixed in CI by the end of Hammerhead (Sprint 6) (Oct 4) KP - data/attribute gap
Projects
None yet
Development

No branches or pull requests

8 participants