Silver Tongue

allthingslinguistic:

I’m weighing in on the gif pronunciation wars at Mental Floss with the help of corpus linguistics: 

Sure, the creator of the gif, Steve Wilhite, prefers a soft g, and sure, gif originated as an acronym for graphics interchange format, but inventors aren’t always good at naming (the zipper was originally called the “clasp locker”), and acronyms aren’t always pronounced like their roots (the “a” in NATO isn’t the same as the “a” in Atlantic). In truth, language is far more democratic.

So Michael Dow, a linguistics professor at Université de Montréal, decided to investigate a different way, and I talked with him about his findings. The idea is, people decide how to pronounce a new word based on its resemblance to words they’re already familiar with. So we can all agree on how to pronounce snapchat because it’s made up of familiar words snap and chat, and we don’t have any problems with blog because it rhymes with frog, log, slog, and so on, but we have no idea how to pronounce doge because there aren’t any other common English words that end in -oge.

The problem with gif isn’t the back half—we already know how to pronounce if. The problem is the front half: Does the i make the g soft or not? It’s clearly not an absolute yes or no—there are English words in both categories: gift has a hard g before i, whereas gin has a soft g before i. What matters is the frequency. So Dow looked at a large corpus of 40,000 unique words with their frequency and pronunciation taken from The English Lexicon Project. Of these words, how many were like gift (hard g) and how many were like gin (soft g)?

[Read the rest.]

  1. rugessnome reblogged this from esoanem
  2. rubyred21 reblogged this from ben-wisehart
  3. ben-wisehart reblogged this from esoanem
  4. myroomismytardis reblogged this from linguisticparadox
  5. mikoyanstan reblogged this from esoanem
  6. harlequindom reblogged this from esoanem
  7. piece-ofmindd reblogged this from esoanem
  8. thebusylilbee reblogged this from datasoong47
  9. esoanem reblogged this from datasoong47 and added:
    If you meet an unfamiliar word in English though, it’s most likely Romance, Greek, or Latin in origin, and loans of...
  10. readinglist32 reblogged this from datasoong47
  11. anytownrory reblogged this from linguisticparadox
  12. startledoctopus reblogged this from datasoong47
  13. jewllyfish reblogged this from linguisticparadox
  14. linguisticparadox reblogged this from datasoong47
  15. datasoong47 reblogged this from linguisticparadox and added:
    Okay, that’s really interesting. My intuitive feel is that soft g is the “default” for the sequence gi, with hard g...
  16. sostanotes reblogged this from linguisticparadox
  17. esermilapow reblogged this from allthingslinguistic
  18. amillioninprizes reblogged this from ghostcat3000
  19. ghostcat3000 reblogged this from deservingporcupine
  20. stressed-coral reblogged this from big-collection-of-chaos
  21. dinosaurrainbowstarfish reblogged this from flyingfish1
  22. space-between-the-sea reblogged this from flyingfish1
  23. flyingfish1 reblogged this from allthingslinguistic
  24. allthingslinguistic posted this