A multimedia playing apparatus is provided. The multimedia playing apparatus includes a memory for storing image files and multi media hyper link (MMHL) files, each MMHL file comprising timeline information, audio information, and text information; and a central processing unit (CPU) electrically connected to the memory for reading an image file from the memory, obtaining an MMHL file matched with the image file based on a name of the image file, and controlling simultaneous output of the image file and the MMHL file according to the timeline information. A multimedia playing method is also provided.