The current development of educational applications for language learning has experienced a qualitative change in the criteria of interaction between users and devices due to the technological advances of input and output data through keyboard, mouse, stylus, tactile screen, etc. The multiple interactions generated in a natural way by humans during ordinary communication can be transferred in a sequential way to devices like PDAs, PC Tablet, etc. depending on the users' needs to carry out specific tasks that allow humans to adapt to their nearest learning context. This paper shows the possibility of establishing multimodal architectures within the applications for specific language learing areas with ubiquitous devices, evidencing the technical and formal aspects necessary for their accomplishment that are currently being developed at the Universidad Politécnica de Valencia (Spain).