Skip to content
Change the repository type filter

All

    Repositories list

    • llmxcpg

      Public
      Source code for LLMxCPG paper
      Jupyter Notebook
      179810Updated Dec 22, 2025Dec 22, 2025
    • F5-TTS

      Public
      Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
      Python
      2k000Updated Dec 21, 2025Dec 21, 2025
    • TeX
      0000Updated Dec 17, 2025Dec 17, 2025
    • This repository provides all datasets released with the EMNLP 2025 paper Advancing Arabic Diacritization: Improved Datasets, Benchmarking, and State-of-the-Art Models.
      Java
      1100Updated Nov 27, 2025Nov 27, 2025
    • This repository provides all data and supporting material released with the EMNLP 2025 paper AraSafe: Benchmarking Safety in Arabic LLMs.
      0200Updated Nov 27, 2025Nov 27, 2025
    • ALT research group publications
      TeX
      13301Updated Nov 23, 2025Nov 23, 2025
    • RetClean

      Public
      AI for Data Repair
      JavaScript
      0900Updated Nov 12, 2025Nov 12, 2025
    • Python
      0000Updated Nov 10, 2025Nov 10, 2025
    • R
      5701Updated Oct 27, 2025Oct 27, 2025
    • 0000Updated Oct 27, 2025Oct 27, 2025
    • darepo1

      Public
      0000Updated Oct 22, 2025Oct 22, 2025
    • azerg

      Public
      Artifacts for our paper: From Text to Actionable Intelligence: Automating STIX Entity and Relationship Extraction
      Python
      0500Updated Oct 14, 2025Oct 14, 2025
    • Python
      0000Updated Oct 11, 2025Oct 11, 2025
    • DialG2P

      Public
      DialG2P: Dialectal Grapheme-to-Phoneme
      0000Updated Sep 22, 2025Sep 22, 2025
    • 0000Updated Sep 16, 2025Sep 16, 2025
    • This repository contains the data relevant to Subtasks 1A, 1B, and 1C of the IslamicEval 2025 Shared Task
      Python
      1400Updated Aug 13, 2025Aug 13, 2025
    • TechniqueRAG: Retrieval Augmented Generation for Adversarial Technique Annotation in Cyber Threat Intelligence Text
      Python
      2800Updated Jul 9, 2025Jul 9, 2025
    • Python
      2100Updated Jul 1, 2025Jul 1, 2025
    • LLMeBench

      Public
      Benchmarking Large Language Models
      Python
      21104202Updated Jun 20, 2025Jun 20, 2025
    • A Survey on Multimodal Retrieval-Augmented Generation
      22000Updated May 17, 2025May 17, 2025
    • c4-qcri

      Public
      replicate c4 dateset process without Apache Beam, use Slurm instead
      Python
      1000Updated Aug 26, 2024Aug 26, 2024
    • CMDL

      Public
      Cross-Modal Data Discovery over Structured and Unstructured Data Lakes
      Jupyter Notebook
      2900Updated Aug 21, 2024Aug 21, 2024
    • Text2TTP

      Public
      A Tool for Semantic Ranking for Automated Adversarial Technique Annotation in Security Text
      Jupyter Notebook
      0710Updated Jul 3, 2024Jul 3, 2024
    • apihub

      Public
      serve and publish API
      Python
      1202Updated Jun 2, 2024Jun 2, 2024
    • Code associated with the ACL24 paper titled, "Exploring Alignment in Shared Cross-Lingual Spaces"
      Jupyter Notebook
      0600Updated May 23, 2024May 23, 2024
    • PFD_Demo

      Public
      Python
      1301Updated Mar 25, 2024Mar 25, 2024
    • A framework for few-shot evaluation of language models.
      Python
      2.9k000Updated Mar 20, 2024Mar 20, 2024
    • C++
      1001Updated Feb 2, 2024Feb 2, 2024
    • The code repository of "Scaling up Discovery of Latent Concepts in Deep NLP Models", Majd Hawasly, Fahim Dalvi and Nadir Durrani, EACL 2024
      Python
      0000Updated Jan 30, 2024Jan 30, 2024
    • QCRI Generative AI Hackathon 2023 Submission Template
      6001Updated Dec 10, 2023Dec 10, 2023