Skip to content

Consider JGI Dremio lakehouse for GOLD sample data #160

@turbomam

Description

@turbomam

Summary

GOLD metadata is now available through the JGI Dremio data lakehouse. This may be relevant for GOLD sample annotation workflows.

Current state

The repo includes GOLD examples (examples/gold.json) suggesting GOLD data is used.

Suggestion

Consider whether the Dremio lakehouse could serve as a data source for GOLD samples, potentially via linkml-store's Dremio adapter.

Benefits

  • Direct SQL access to GOLD biosamples
  • Pre-structured tables
  • No API pagination limits

References

  • linkml/linkml-store
  • turbomam/jgi-dremio-exploration (minimal example)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions