As a new coding tool of Versatile Video Coding, Luma Mapping with Chroma Scaling (LMCS) maps luma samples and scales chroma residuals based on the mapping function to improve video quality and compression ratio. To achieve real-time decoding and improve the throughput, a hardware decoder needs to be developed. In this paper, a high throughput LMCS hardware decoder design was proposed. Three separate modules were designed to realize the LMCS processes with two implementation schemes of the Luma Mapping module discussed and the better one used. The proposed design reached 535 MHz at 28nm technology, capable of processing videos at 4K@120fps.