CCAiM hugging face collection 🤗
Currently in the dataset collection phase.
CCAiM aims to develop an AI-powered model for classifying clouds based on ground-level photographs. The project uses image recognition techniques to identify cloud types according to the WMO International Cloud Atlas classification.
The model will be trained to recognize the following cloud types:
- Cirrus (Ci) – Thin, wispy clouds high in the sky
- Cirrostratus (Cs) – Transparent, whitish veil clouds
- Cirrocumulus (Cc) – Small, white patchy clouds
- Altostratus (As) – Gray/blue layer clouds preceding storms
- Altocumulus (Ac) – White/gray layered clouds with shading
- Stratus (St) – Uniform gray cloud blanket
- Stratocumulus (Sc) – Low lumpy clouds with blue sky gaps
- Nimbostratus (Ns) – Dark precipitation clouds
- Cumulus (Cu) – Fluffy white clouds with flat bases
- Cumulonimbus (Cb) – Towering thunderstorm clouds
dreams
- 📷 Dataset expansion – Using contributor photographs
- 🧠 Multiple model versions:
- V1: Initial first stable model (minimal viable dataset)
- 🔍 Image classification API for integrations
- 🌐 Interactive web demo
- 📊 Model evaluation tools
now can
- collect dataset
- create first model V0.0.1
- All data exist in hugging face
- Code in this repository is licensed under the MIT License (see LICENSE).
- Photographs located in folders named clouds_<dataset_number> are licensed under CC0 1.0 Universal (Public Domain), meaning they can be used freely for any purpose, including commercial use, without attribution.
- If other datasets are added in the future, their license terms will be specified in a separate license file inside their respective folder.
All models exist in hugging face
- v_0.0.1 # first model learned by 23 photo.
- v_0.0.2 # second model learned by 42 photo
- v_0.0.3 # model learned by 88 photo
- v_0.0.4 # model learned by 165 photo
- The most helpful contribution at the moment: If you find a discrepancy between the cloud class specified in the JSON label and the actual image content, correcting it will greatly improve dataset quality.
