Mammals use multiple sensory cues for mother-offspring recognition. While the role of single sensory cues has been well studied, we lack information about how multiple cues produced by mothers are integrated by their offspring. Knowing that Australian sea lion (Neophoca cinerea) pups recognise their mother's calls, we first tested whether visual cues are used by pups to discriminate between conspecifics of different age classes (adult female vs pup). We then examined if adding a visual stimulus to an acoustic cue enhances vocal responsiveness of Australian sea lion pups, by presenting wild individuals with either a visual cue (female 3D-model), an acoustic cue (mother's call), or both simultaneously, and observing their reaction. We showed that visual cues can be used by pups to distinguish adult females from other individuals, however we found no enhancement effect of these cues on the response in a multimodal scenario. Audio-only cues prompted a similar reaction to audio-visual cues that was significantly stronger than pup response to visual-only cues. Our results suggest that visual cues are dominated by acoustic cues and that pups rely on the latter in mother recognition.