Projects

These are some of the open-source projects I work on to make neural text generation technologies more readily available for developers and researchers.

OpenAI Watch
Monitoring the consistency of GPT-4 under greedy decoding (T=0) with a web-based application that commands GPT-4 to draw a unicorn every hour.
github
Markup-to-Image Diffusion Models with Scheduled Sampling
A learning-based system to compile presentational markup, such as LaTeX, into corresponding images.
github
Neural Linguistic Steganography
A practical linguistic steganography algorithm via arithmetic coding and strong neural models.
github
OpenNMT
A full service open-source neural machine translation system. Originally developed in Lua with Systran, since ported to PyTorch and TensorFlow and maintained externally.
github
Image-to-Markup
A learning-based system to decompile an image into presentational markup. For example, we can infer the LaTeX or HTML source from a rendered image.
github
Cascaded Text Generation with Markov Transformers
A parallel, fast, autoregressive, and accurate text generation algorithm using high-order Conditional Random Fields (CRFs).
github