Git based Version Control File System for joint management of code, data, model and their relationship.
-
Updated
Nov 1, 2025 - TypeScript
Git based Version Control File System for joint management of code, data, model and their relationship.
A Simple Image Clustering Script using CLIP and Hierarchial Clustering
A tool to streamline AI image captioning
code for generating a high-quality knowledge graph with metadata about datasets and links to publications
A free and opensource yolov8-yolov11 all in one training tool that automates file structure and yaml files, auto labeling with SAM2, brush system for uninterupted labeling, a strong modular augmentation system where anybody can write their own filters and training. Without having to open terminal.
A resource for biomedical students and researchers. Includes proteomics software tools like FragPipe, MaxQuant, PDV, SearchGUI, ThermoRawFileParser, and PeptideShaker. Offers a user-friendly interface, automated identification and quantification, comprehensive data analysis, and lightweight clone feature for optimized storage.
Pluk is a simple dataset management system which stores data in chunks and a virtual filesystem in DB. Also includes kdataset CLI tool
PixelPruner Gradio is a user-friendly image cropping & dataset management app. It supports PNG, JPG, JPEG, GIF, BMP, and TIFF formats. Easily crop, preview, and manage images with interactive previews, thumbnail views, and Zip packaging. Streamline your workflow and achieve perfect crops every time with PixelPruner.
Roboflow-lite alternative: a local-first, open-source MLOps toolkit for building and training computer-vision models.
This is the 'data.aykhan.net' repository, serving as a dedicated static data API. It offers structured endpoints for user profiles, product details, events, and more, simplifying data access for web and software projects. Explore and integrate reliable static data into your applications with ease.
HuggingFace Datasets for Elixir - A native Elixir port of the popular HuggingFace datasets library. Stream, load, and process ML datasets from the HuggingFace Hub with full BEAM/OTP integration. Supports Parquet streaming, dataset splitting, shuffling, and seamless integration with Nx tensors for machine learning workflows.
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
A modular research framework engineered to benchmark CNN models across multiple sign language datasets. Featuring a scalable architecture (Factory Pattern), optimized HSV-based hand segmentation, and real-time inference capabilities for edge deployment.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Développer une application web interactive permettant à l’utilisateur de créer et gérer des datasets d’images (ex. « chat » ou « chien ») et de tester un modèle de prédiction simulé.
Crush.js is a dataset utility library
Simple project that extract, clean and process a dataset and import the data to a nosql database. Implementation of a simple app to work with.
A Deep Learning Python Toolkit for Healthcare Applications.
Add a description, image, and links to the dataset-management topic page so that developers can more easily learn about it.
To associate your repository with the dataset-management topic, visit your repo's landing page and select "manage topics."