Croissant is a metadata standard. See: - https://research.google/blog/croissant-a-metadata-format-for-ml-ready-datasets/ - https://github.com/mlcommons/croissant A schema for extracting metadata from a data descriptor, manuscript, etc. would be useful.