W3Techs
advertise here
provided by
Q-Success
Home Technologies Reports API Sites Quality Users Blog Forum FAQ Search

Featured products and servicesadvertise here

Technologies

Content Management
Server-side Languages
Client-side Languages
JavaScript Libraries
CSS Frameworks
Web Servers
Web Panels
Operating Systems
Web Hosting
Data Centers
Reverse Proxies
DNS Servers
Email Servers
SSL Certificate Authorities
Content Delivery
Traffic Analysis Tools
Advertising Networks
Tag Managers
Social Widgets
Site Elements
Structured Data
Markup Languages
Character Encodings
Image File Formats
Top Level Domains
Server Locations
Content Languages

Related Reports

Segmentation

Technologies > Character Encodings > UTF-8 > by Content Languages

Usage of UTF-8 broken down by content languages

This diagram shows the percentages of websites using UTF-8 broken down by content languages. See technologies overview for explanations on the methodologies used in the surveys.

How to read the diagram:
UTF-8 is used by 98.5% of all the websites whose character encoding we know.
UTF-8 is used by 99.5% of all the websites whose character encoding we know and that use English as content language.

Overall
98.5%
English
99.5%
Spanish
99.4%
German
98.4%
Japanese
98.7%
French
98.8%
Russian
97.4%
Portuguese
98.2%
Italian
98.8%
Dutch, Flemish
99.4%
Polish
99.4%
Turkish
98.7%
Persian
100.0%
Chinese
98.1%
Vietnamese
100.0%
Indonesian
96.0%
Czech
98.2%
Korean
97.8%
Ukrainian
99.4%
Hungarian
98.4%
Arabic
99.3%
Romanian
99.6%
Swedish
98.2%
Greek
99.6%
Hebrew
99.4%
Danish
95.9%
Finnish
98.0%
Slovak
98.7%
Thai
99.3%
Bulgarian
99.1%
Serbian
99.8%
Croatian
99.6%
Lithuanian
99.8%
Norwegian Bokmål
99.5%
Slovenian
99.6%
Catalan, Valencian
99.2%
Estonian
99.8%
Norwegian
98.4%
Latvian
99.7%
Bosnian
99.9%
Hindi
99.7%
Azerbaijani
100.0%
Georgian
99.8%
Icelandic
98.8%
Macedonian
99.6%
Albanian
99.8%
Bengali
99.9%
Armenian
100.0%
Kazakh
99.9%
Basque
99.0%
Malay
99.7%
Uzbek
99.8%
Galician
98.5%
Kanuri
96.3%
Mongolian
100.0%
Northern Sami
92.3%
Urdu
99.7%
Norwegian Nynorsk
100.0%
Nepali
100.0%
Marathi
100.0%
Belarusian
99.1%
Tamil
99.5%
Afrikaans
97.8%
Faroese
99.4%
Khmer, Cambodian
100.0%
Sinhala, Sinhalese
99.7%
Burmese
99.1%
Tagalog
100.0%
Esperanto
99.6%
Tahitian
100.0%
Welsh
97.6%
Swahili
99.0%
Telugu
100.0%
Malayalam
100.0%
Kirghiz, Kyrgyz
100.0%
Sorani, Central Kurdish
100.0%
Tajik
100.0%
Filipino, Pilipino
98.7%
Kannada
100.0%
Divehi, Dhivehi, Maldivian
100.0%
Irish
93.6%
Gujarati
100.0%
Luxembourgish, Letzeburgesch
100.0%
Kurdish
100.0%
Lao
100.0%
Turkmen
100.0%
Amharic
98.9%
Bashkir
96.3%
Pushto, Pashto
100.0%
Maltese
100.0%
Tatar
100.0%
Abkhazian
100.0%
Sanskrit
100.0%
Panjabi, Punjabi
100.0%
Breton
95.7%
Kalaallisut, Greenlandic
100.0%
Latin
97.7%
Bambara
100.0%
Kinyarwanda
100.0%
Somali
96.6%
Swiss German, Alemannic, Alsatian
100.0%
Avestan
100.0%
Papiamento
100.0%
Chamorro
100.0%
Venda
100.0%
Tibetan
100.0%
Malagasy
100.0%
Odia
100.0%
Romansh
100.0%
Dzongkha
100.0%
Uighur, Uyghur
100.0%
Occitan, Provençal
100.0%
Afar
100.0%
Western Frisian
100.0%
Assamese
100.0%
Haitian, Haitian Creole
100.0%
Asturian, Bable, Leonese, Asturleonese
100.0%
Corsican
92.9%
Gaelic, Scottish Gaelic
100.0%
Sardinian
100.0%
Sindhi
100.0%
Maori
100.0%
W3Techs.com, 12 February 2025
Percentages of websites using UTF-8 broken down by content languages

Share this page

Technology Brief
UTF-8
Category: Character Encodings
UTF-8 (8-bit Unicode Transformation Format) is a variable-length character encoding for Unicode, which is backwards compatible with ASCII.
Website: datatracker.ietf.org/...


advertise here

About Us Disclaimer Terms of Use Privacy Policy Advertising Contact
W3Techs on   LinkedIn LinkedIn Mastodon Mastodon Bluesky Bluesky
Copyright © 2009-2025 Q-Success