The Spanish-speaking world is not one Spanish
Linguists describe the major Spanish dialect zones as Castilian (Spain, mostly), Mexican, Caribbean (Cuba, Puerto Rico, Dominican Republic), Andean (Peru, Bolivia, Ecuador), Rioplatense (Argentina, Uruguay, Paraguay), Chilean, and Andalusian. These differ in pronunciation, grammar, and vocabulary in ways that matter for transcription.
Seseo, distinción, and ceceo
One of the most distinctive splits. Castilian Spanish in northern Spain distinguishes the /θ/ sound (like English "th" in "think") from /s/, so "casar" (to marry) and "cazar" (to hunt) sound different. Most of Latin America uses seseo: both spelled differently but pronounced the same as /s/. Some Andalusian speakers use ceceo: both pronounced as /θ/. The spelling stays the same in all three cases, which is why Whisper transcripts read as "standard" Spanish regardless of the pronunciation variety.
Voseo and ustedeo
The pronoun and verb system for "you (singular informal)" varies. Castilian and most Latin American Spanish uses tú (tuteo): "tú tienes", "tú hablas". Rioplatense uses vos (voseo): "vos tenés", "vos hablás", with distinct verb endings. Central American Spanish uses both, sometimes for different registers. Colombian Spanish uses usted in some informal contexts where Castilian would use tú (ustedeo). Whisper transcribes whichever form the speaker used.
Vosotros vs ustedes
For "you (plural)", Castilian Spanish uses vosotros (with its own verb conjugations: "vosotros tenéis"). All of Latin America uses ustedes for both formal and informal plural ("ustedes tienen"). Distinctive enough that a transcript using "vosotros tenéis" is almost certainly from a Spanish speaker, while "ustedes tienen" could be from anywhere in the Americas.
Regional vocabulary that differs notably
Computer: ordenador (Spain) vs computadora (most Latin America). Bus: autobús, camión (Mexico), guagua (Cuba, Canary Islands), colectivo (Argentina), micro (Chile). Potato: patata (Spain) vs papa (most Latin America). Car: coche (Spain) vs carro (most Latin America) vs auto (Argentina, Chile). Phone: móvil (Spain) vs celular (most Latin America). Cool: guay (Spain) vs chido (Mexico) vs bacán (Chile, Peru, Cuba) vs piola (Argentina). The transcript reflects what the speaker said; if you need consistency across sources, normalise in the editor.
Code-switching with English (Spanglish)
US Hispanic and border-region Spanish often mixes English words and phrases freely. Whisper handles code-switching reasonably well within a single utterance, transcribing English words as English and Spanish words as Spanish. Long stretches of one language followed by the other work best when the language picker is set to the dominant one.