Surface electromyography (sEMG) signal is a physiological electrical signal produced by muscle contraction. Different gestures can be effectively recognized from the characteristics of the sEMG signal. Currently, convolutional neural networks (CNNs) have been widely used in sEMG gesture recognition systems due to their capabilities in acquiring spatial features of sEMG signals. However, these classical CNNs are inefficient in extracting temporal correlation that resides in the time serials of sEMG signals, which is definitely important for gesture recognition. To overcome such a drawback of traditional CNN-based gesture recognition methods, we propose a multichannel hybrid deep learning model for gesture recognition by combining the multichannel CNNs with a gated recurrent unit (GRU). Specifically, we use multiple CNNs to preprocess the original multichannel EMG signals in a one-by-one manner to obtain the spatial features in the current observing window. The outputs of the multiple CNNs are concatenated and fed to a temporal-feature extracting module, which is designed by cascading a GRU with an attention mechanism. Through the GRU, the temporal features of successive signal frames can be established, while the attention mechanism is introduced to further focus on the key information in recognizing the gestures, which is beneficial to improve the robustness and accuracy of the model. Experiments show that the recognition accuracy of the proposed method reaches 97.6% and 96.7% on the Ninapro DB2 and Ninapro DB5 datasets, respectively. Compared with the classical CNN method, the performance improvement is 2.9% and xx% higher than that of the traditona CNN model, respectively.