Recently there have been many works applying diffusion models to NLP domain, they majorly take two approaches, either they alter the diffusion process a little to make the noising and denoising steps possible for discrete data, or the second approach is to convert the discrete text data into a continuous format (through embeddings).
This Blog is written on Medium, view it here Continuous Diffusion Models for NLP