The purpose of this written paper is about the greatly improvement of the voice and gesture recognition by using KINECT system. In this research of my paper, I had been based on the Natural User Interface (NUI), speech and actions to PC interface which allows users to have more convenient. The system consists of voice recognition systems, gesture recognition system. Actually, there are several processes in the system such as sound signal input, extract of the signal, commands system, and the final implementation. The system will identify every action by using lens so that the system will capture and implement it. However, my system still has some of shortcomings that need to improve. So in the future I hope my system which was originally based on Graphical User Interface (GUI) will be improved.