ppc64le/linux/: xgrammar-0.1.33+ppc64le1 metadata and description

Simple index

Efficient, Flexible and Portable Structured Generation

author MLC Team
classifiers
  • License :: OSI Approved :: Apache Software License
  • Development Status :: 4 - Beta
  • Intended Audience :: Developers
  • Intended Audience :: Education
  • Intended Audience :: Science/Research
  • Environment :: MetaData :: IBM Python Ecosystem
description_content_type text/markdown
keywords machine learning,inference
license Apache 2.0
project_urls
  • Homepage, https://xgrammar.mlc.ai/
  • GitHub, https://github.com/mlc-ai/xgrammar
provides_extras
  • test
  • metal
requires_dist
  • pydantic
  • torch>=1.10.0
  • transformers>=4.38.0
  • triton; platform_system == "Linux" and platform_machine == "x86_64"
  • numpy
  • typing-extensions>=4.9.0
  • huggingface-hub[cli]; extra == "test"
  • protobuf; extra == "test"
  • pytest; extra == "test"
  • sentencepiece; extra == "test"
  • tiktoken; extra == "test"
  • transformers<4.50.0; platform_system == "Darwin" and extra == "test"
  • mlx-lm; platform_system == "Darwin" and platform_machine == "arm64" and extra == "metal"
requires_python <4,>=3.8
File Tox results History
xgrammar-0.1.33+ppc64le1-cp311-cp311-manylinux_2_34_ppc64le.whl
Size
41 MB
Type
Python Wheel
Python
3.11
xgrammar-0.1.33+ppc64le1-cp312-cp312-manylinux_2_34_ppc64le.whl
Size
41 MB
Type
Python Wheel
Python
3.12

logo

Documentation License PyPI PyPI Downloads Ask DeepWiki

Efficient, Flexible and Portable Structured Generation

Get Started | Documentation | Blogpost | Technical Report

News

Overview

XGrammar is an open-source library for efficient, flexible, and portable structured generation.

It leverages constrained decoding to ensure 100% structural correctness of the output. It supports general context-free grammar to enable a broad range of structures, including JSON, regex, custom context-free grammar, etc.

XGrammar uses careful optimizations to achieve extremely low overhead in structured generation. It has achieved near-zero overhead in JSON generation, making it one of the fastest structured generation engines available.

XGrammar features universal deployment. It supports:

XGrammar is very easy to integrate with LLM inference engines. It is the default structured generation backend for most LLM inference engines, including vLLM, SGLang, TensorRT-LLM, and MLC-LLM, as well as many other companies. You can also try out their structured generation modes!

Get Started

Install XGrammar:

pip install xgrammar

For use with MPS on Apple Silicon, install with:

pip install "xgrammar[metal]"

Import XGrammar:

import xgrammar as xgr

Please visit our documentation to get started with XGrammar.

Third-Party Bindings

Collaborators

XGrammar has been widely adopted in industry, open-source projects, and academia. Our collaborators include:

WebLLM

Citation

If you find XGrammar useful in your research, please consider citing our paper:

@article{dong2024xgrammar,
  title={Xgrammar: Flexible and efficient structured generation engine for large language models},
  author={Dong, Yixin and Ruan, Charlie F and Cai, Yaxing and Lai, Ruihang and Xu, Ziyi and Zhao, Yilong and Chen, Tianqi},
  journal={Proceedings of Machine Learning and Systems 7},
  year={2024}
}

Export Classification Notice

The software hosted on this website consists of publicly available open‑source packages. To the extent U.S. export regulations apply, software that is publicly available as described in 15 C.F.R. §§ 734.7 (for non-encryption software) or 742.15(b) (for encryption software) is not subject to the Export Administration Regulations (EAR). Users are responsible for complying with all applicable export laws and regulations.