A light-weight, extendable, high level, universal code parser built on top of tree-sitter
-
Updated
Dec 2, 2021 - Python
A light-weight, extendable, high level, universal code parser built on top of tree-sitter
(1) Code mining + clickable code paths in vscode + system terminal: https://github.com/qualiu/vscode-msr (2) Visual Studio (2012~2022+) Clickable terminal integration: https://github.com/qualiu/msrTools/tree/master/code/vs-conemu (3) UI helper for msr/nin: https://github.com/qualiu/msrUI
(1) Find definition + Code mining + File processing via menu/mouse/terminal in vscode or command out-of vscode. (2) Vscode + other IDEs + system terminal integration. (3) Visual Studio (like VS2022) terminal integration (clickable file paths): https://github.com/qualiu/msrTools/blob/master/code/vs-conemu/README.md
Sievio is a Python toolkit that streams GitHub, local repositories, and other text/code sources into clean JSONL corpora for LLM pre-training, fine-tuning, or RAG. It includes structure-aware chunking, robust Unicode decoding, pluggable quality and safety screening, and optional dataset card generation and deduplication support.
Analyzing and Supporting Adaptation of Online Code Examples (ICSE 2019)
Fetch all kernels written for competitions from Kaggle.
Add a description, image, and links to the code-mining topic page so that developers can more easily learn about it.
To associate your repository with the code-mining topic, visit your repo's landing page and select "manage topics."