Google

Saturday, December 1, 2007

The Greek Language

Greek (ελληνική γλώσσα IPA: [eliniˈki ˈɣlosa] or simply ελληνικά IPA: [eliniˈka] — "Hellenic") has a documented history of 3,500 years, the longest of any single natural language in the Indo-European language family. It is also one of the earliest attested Indo-European languages, with fragmentary records in Mycenaean dating back to the 15th or 14th century BC, making it the world's oldest recorded living language. Today, it is spoken by approximately 15–25 million people in Greece (official), Cyprus (official), Albania, Bulgaria, the Former Yugoslav Republic of Macedonia (FYROM), Italy, Turkey, Armenia, Georgia, Ukraine, Moldova, Romania, Russia, Egypt, Jordan and emigrant communities around the world, including Australia, United States, Canada, Germany and elsewhere.

Greek has been written in the Greek alphabet (the first to introduce vowels) since the 9th century BC in Greece (before that in Linear B), and the 4th century BC in Cyprus (before that in Cypriot syllabary). Greek literature has a continuous history of nearly three thousand years.

Greek has been written in the Greek alphabet since approximately the 9th century BC. In classical Greek, as in classical Latin, only upper-case letters existed. The lower-case Greek letters were developed much later by medieval scribes to permit a faster, more convenient cursive writing style with the use of ink and quill. The variant of the alphabet in use today is essentially the late Ionic variant, introduced for writing classical Attic in 403 BC.

The modern Greek alphabet consists of 24 letters, each with a capital (majuscule) and lowercase (minuscule) form. The letter sigma has an additional lowercase form (ς) used in final position.

Majuscule form

Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο Π Ρ Σ Τ Υ Φ Χ Ψ Ω

Minuscule form

α β γ δ ε ζ η θ ι κ λ μ ν ξ ο π ρ σ τ υ φ χ ψ ω


In addition to the letters, the Greek alphabet also features a number of diacritical signs: three different accent marks (acute, grave and circumflex), originally denoting different shapes of pitch accent on the stressed vowel; the so-called breathing marks (spiritus asper and spiritus lenis), originally used to signal presence or absence of word-initial /h/; and the diaeresis, used to mark full syllabic value of a vowel that would otherwise be read as part of a diphthong. These marks were introduced during the course of the Hellenistic period. Actual usage of the grave in handwriting had seen a rapid decline in favor of uniform usage of the acute during the late 20th century, and it had only been retained in typography.

In the writing reform of 1982, the use of most of them was abolished from official use in Greece. Since then, Modern Greek has been written mostly in the simplified monotonic orthography, which employs only the acute accent and the diaeresis. The traditional system, now called the polytonic orthography, is still in use for Modern Greek in book printing and in the usage of some writers in general, and it is used internationally for the writing of Ancient Greek.


The Malay language

The Malay language (ISO 639-1 code: MS) (Malay: Bahasa Melayu; Jawi script: بهاس ملايو), is an Austronesian language spoken by the Malay people who reside in the Malay Peninsula, southern Thailand, the Philippines, Singapore, central eastern Sumatra, the Riau islands, parts of the coast of Borneo and even in the Netherlands. It is an official language of Malaysia, Brunei and Singapore. It is very similar to Indonesian, known locally as Bahasa Indonesia, the official language of Indonesia.

In Malaysia, the language is officially known as Bahasa Malaysia, which translates as the "Malaysian language". The term, which was introduced by the National Language Act 1967, was predominant until the 1990s, when most academics and government officials reverted to "Bahasa Melayu," which is used in the Malay version of the Federal Constitution. According to Article 152 of the Federal Constitution, Bahasa Melayu is the official language of Malaysia. "Bahasa kebangsaan" (National language) was also used at one point during the 1970s.

Indonesia adopted a form of Malay as its official language upon independence, naming it Bahasa Indonesia and although a large degree of mutual intelligibility exists, Indonesian is distinct in many ways from Malay as spoken in Malaysia. In Singapore and Brunei it is known simply as Malay or Bahasa Melayu. 'Bahasa Melayu' is specified as the Brunei's official language by the country's 1959 Constitution.

However, many Malay dialects are not as mutually intelligible: for example, Kelantanese pronunciation is difficult even for some Malaysians to understand, while Indonesian tends to have a lot of words unique to it which will be unfamiliar to other speakers of Malay.

The language spoken by the Peranakan (Straits Chinese, a hybrid of Chinese settlers from the Ming Dynasty and local Malays) is a unique patois of Malay and the Chinese dialect of Hokkien, which is mostly spoken in the former Straits Settlements of Penang and Malacca.

Writing system

Malay is normally written using Latin alphabet called Rumi, although a modified Arabic script called Jawi also exists. Efforts are currently being undertaken to preserve Jawi script and to revive its use amongst Malays in Malaysia, and students taking Malay language examination in Malaysia have the option of answering questions using Jawi script. Latin alphabet, however, is still the most commonly used script in Malaysia, both for official and informal purposes.

Historically, Malay language has been written using various types of script. Before the introduction of Arabic script in the Malay region, Malay was written using Pallava, Kawi and Rencong script. Old Malay was written using Pallava and Kawi script, as evident from several inscription stones in the Malay region. Starting from the era of kingdom of Pasai and throughout the golden age of Sultanate of Malacca, Jawi has gradually replaced these scripts as the most commonly used script in the Malay region.

The English Language


Canada, the Commonwealth Caribbean, Ireland, New Zealand, the United Kingdom and the United States of America (also commonly known as the Anglosphere). It is used extensively as a second language and as an official language throughout the world, especially in Commonwealth countries and in many international organisations.

Modern English is sometimes described as the global lingua franca. English is the dominant international language in communications, science, business, aviation, entertainment, radio and diplomacy. The influence of the British Empire is the primary reason for the initial spread of the language far beyond the British Isles. Following World War II, the growing economic and cultural influence of the United States has significantly accelerated the adoption of English.

A working knowledge of English is required in certain fields, professions, and occupations. As a result over a billion people speak English at least at a basic level (see English language learning and teaching). English is one of six official languages of the United Nations.

Number of words in English

English has an extraordinarily rich vocabulary and willingness to absorb new words. As the General Explanations at the beginning of the Oxford English Dictionary states:“ The Vocabulary of a widely diffused and highly cultivated living language is not a fixed quantity circumscribed by definite limits... there is absolutely no defining line in any direction: the circle of the English language has a well-defined centre but no discernible circumference. ”


The vocabulary of English is undoubtedly vast, but assigning a specific number to its size is more a matter of definition than of calculation. Unlike other languages, there is no Academy to define officially accepted words. Neologisms are coined regularly in medicine, science and technology and other fields, and new slang is constantly developed. Some of these new words enter wide usage; others remain restricted to small circles. Foreign words used in immigrant communities often make their way into wider English usage. Archaic, dialectal, and regional words might or might not be widely considered as "English".

The Oxford English Dictionary, 2nd edition (OED2) includes over 600,000 definitions, following a rather inclusive policy:“ It embraces not only the standard language of literature and conversation, whether current at the moment, or obsolete, or archaic, but also the main technical vocabulary, and a large measure of dialectal usage and slang (Supplement to the OED, 1933). ”


The editors of Webster's Third New International Dictionary, Unabridged (475,000 main headwords) in their preface, estimate the number to be much higher. It is estimated that about 25,000 words are added to the language each year.

The Japanese Language


Japanese (日本語 / にほんご Nihongo (help·info)?) is a language spoken by over 130 million people, in Japan and in Japanese emigrant communities around the world. It is an agglutinative language and is distinguished by a complex system of honorifics reflecting the hierarchical nature of Japanese society, with verb forms and particular vocabulary to indicate the relative status of speaker, listener and the person mentioned in conversation. The sound inventory of Japanese is relatively small, and has a lexically distinct pitch-accent system. Early Japanese is known largely on the basis of its state in the 8th century, when the three major works of Old Japanese were compiled; but smaller amounts of material, primarily inscriptional, are older. The earliest attestation of Japanese is in a Chinese document from 252 A.D.

The Japanese language is written with a combination of three different types of scripts: modified Chinese characters called kanji (漢字 かんじ), and two syllabic scripts, hiragana (平仮名 ひらがな) and katakana (片仮名 カタカナ). The Latin alphabet, rōmaji, is also often used in modern Japanese, especially for company names and logos, advertising, and when inputting Japanese into a computer. Western style Arabic numerals are generally used for numbers, but traditional Sino-Japanese numerals are also commonplace.

Japanese vocabulary has been heavily influenced by loans from other languages. A vast number of words were borrowed from Chinese, or created from Chinese models, over a period of at least 1,500 years. Since the late 19th century, Japanese has borrowed a considerable number of words from Indo-European languages, primarily English. Because of the special trade relationship between Japan and Holland in the 17th century, Dutch has also been influential, with words like bīru (from bier; "beer") and kōhī (from koffie; "coffee").

The Chinese Language


Chinese or the Sinitic language(s) (汉语/漢語, Pinyin: Hànyǔ; 华语/華語, Huáyǔ; or 中文, Zhōngwén) can be considered a language or language family. Originally the indigenous languages spoken by the Han Chinese in China, it forms one of the two branches of Sino-Tibetan family of languages. About one-sixth of the world’s population, or over 1 billion people, speak some form of Chinese as their native language. The identification of the varieties of Chinese as "languages" or "dialects" is controversial . As a language family Chinese has an estimated nearly 1.2 billion speakers; Mandarin Chinese alone has around 850 million native speakers, outnumbering any other language in the world.

Spoken Chinese is distinguished by its high level of internal diversity, though all spoken varieties of Chinese are tonal and analytic. There are between six and twelve main regional groups of Chinese (depending on classification scheme), of which the most populous (by far) is Mandarin (c. 850 million), followed by Wu (c. 90 million), Min (c. 70 million) and Cantonese (c. 70 million). Most of these groups are mutually unintelligible, though some, like Xiang and the Southwest Mandarin dialects, may share common terms and some degree of intelligibility. Chinese is classified as a macrolanguage with 13 sub-languages in ISO 639-3, though the identification of the varieties of Chinese as multiple "languages" or as "dialects" of a single language is a contentious issue.

The standardized form of spoken Chinese is Standard Mandarin (Putonghua/Guoyu), based on the Beijing dialect. Standard Mandarin is the official language of the People's Republic of China, the Republic of China in Taiwan, as well as one of four official languages of Singapore. Chinese—de facto, Standard Mandarin—is one of the six official languages of the United Nations. Of the other varieties, Standard Cantonese is common and influential in Cantonese-speaking overseas communities, and remains one of the official languages of Hong Kong (together with English) and of Macau (together with Portuguese). Min Nan, part of the Min language group, is widely spoken in southern Fujian, in Taiwan (where it is known as Taiwanese or Hoklo) and in Southeast Asia (where it dominates in Singapore and Malaysia and is known as Hokkien).

The Chinese written language employs Chinese characters (漢字/汉字 pinyin: hànzì), which are logograms: each symbol represents a semanteme or morpheme (a meaningful unit of language), as well as one syllable; the written language can thus be termed a morphemo-syllabic script.

Chinese characters evolved over time from earliest forms of hieroglyphs. The idea that all Chinese characters are either pictographs or ideographs is an erroneous one: most characters contain phonetic parts, and are composites of phonetic components and semantic Radicals. Only the simplest characters, such as ren 人 (human), ri 日 (sun), shan 山 (mountain), shui 水 (water), may be wholly pictorial in origin. In 100 AD, the famed scholar Xǚ Shèn in the Hàn Dynasty classified characters into 6 categories, namely pictographs, simple ideographs, compound ideographs, phonetic loans, phonetic compounds and derivative characters. Of these, only 4% as pictographs, and 80-90% as phonetic complexes consisting of a semantic element that indicates meaning, and a phonetic element that arguably once indicated the pronunciation. There are about 214 radicals recognized in the Kangxi Dictionary, which indicate what the character is about semantically.

Modern characters are styled after the standard script (楷书/楷書 kǎishū) (see styles, below). Various other written styles are also used in East Asian calligraphy, including seal script (篆书/篆書 zhuànshū), cursive script (草书/草書 cǎoshū) and clerical script (隶书/隸書 lìshū). Calligraphy artists can write in traditional and simplified characters, but tend to use traditional characters for traditional art.

Various styles of Chinese calligraphy.

There are currently two systems for Chinese characters. The traditional system, still used in Hong Kong, Taiwan and Macau, takes its form from standardized character forms dating back since the late Han dynasty. The Simplified Chinese character system, developed by the PRC Mainland China in 1954 to promote mass literacy, simplifies most complex traditional glyphs to fewer strokes, many to common caoshu shorthand variants, with a larger pool of synonymous characters.

Singapore, which has a large Chinese community, is the first – and at present the only – foreign nation to officially adopt simplified characters, although it has also become the de facto standard for younger ethnic Chinese in Malaysia. The Internet provides the platform to practice reading the alternative system, be it traditional or simplified.

A well-educated Chinese today recognizes approximately 6,000-7,000 characters; some 3,000 of them are required to read a Mainland newspaper. The PRC government defines literacy amongst workers as a knowledge of 2,000 characters, though this literacy could be pretty functional. A large unabridged dictionary like the Kangxi Dictionary contains over 40,000 characters, including obscure, variant and archaic characters; only a quarter are now commonly used.

The Hindi LanguageHindi (Devanāgarī: हिन्दी or हिंदी, IAST: Hindī, IPA: [hɪnd̪iː]), an Indo-European language spoken all over India in varying degrees and extensively in northern and central India, is one of the 22 official languages of India and is used, along with English, for central government administrative purposes. It is part of a language continuum of the Indic family, bounded on the northwest and west by Punjabi, Sindhi, and Gujarati; on the south by Marathi and Konkani; on the southeast by Oriya; on the east by Bengali; and on the north by Nepali.

More precisely, Hindi also refers to a standardized register of Hindustani language termed khariboli, that emerged as the standard dialect.

Pronouns

Hindi has pronouns in the first, second and third person for one gender only. Thus, unlike English, there is no difference between he or she. More strictly speaking, the third person of the pronoun is actually the same as the demonstrative pronoun (this / that). The verb, upon conjugation, usually indicates the difference in the gender. The pronouns have additional cases of accusative and genitive, but no vocative. There may also be binary ways of inflecting the pronoun in the accusative case. Note that for the second person of the pronoun (you), Hindi has three levels of honorifics:
आप (/ɑːp/): Formal and respectable form for you. Has no difference between the singular and the plural. Used in all formal settings and speaking to persons who are senior in job or age. Plural could be stressed by saying आप लोग (/ɑːp log/ you people) or आप सब (/ɑːp səb/ you all).
तुम (/t̪um/): Informal form of you. Has no difference between the singular and the plural. Used in all informal settings and speaking to persons who are junior in job or age. Plural could be stressed by saying तुम लोग (/t̪um log/ you people) or तुम सब (/t̪um səb/ you all).
तू (/t̪uː/): Extremely informal form of you, as thou. Strictly singular, its plural form being /t̪um/. Except for very close friends or poetic language involving God, it could be perceived as offensive in India.

Imperatives (requests and commands) correspond in form to the level of honorific being used, and the verb inflects to show the level of respect and politeness desired. Because imperatives can already include politeness, the word "kripayā", which can be translated as "please", is much less common than in spoken English; it is generally only used in writing or announcements, and its use in common speech may even reflect mockery.

The Tamil Language


Tamil (தமிழ் tamiḻ; IPA: [t̪ɐmɨɻ]) is a Dravidian language spoken predominantly by Tamils in India, Sri Lanka and Singapore and smaller communities of speakers in many other countries. It is the official language of the Indian state of Tamil Nadu, classical language in India, and has official status in India , Sri Lanka and Singapore. With more than 77 million speakers, Tamil is one of the widely spoken languages in the world.

Tamil has a literary tradition of over two thousand years. The earliest epigraphic records found date to around 300 BC and the Tholkappiyam, oldest known literary work in Tamil, has been dated variously between second century BC and tenth century AD. Tamil was declared a classical language of India by the Government of India in 2004 and was the first Indian language to have been accorded the status.

Tamil employs agglutinative grammar, where suffixes are used to mark noun class, number, and case, verb tense and other grammatical categories. Unlike other Dravidian languages, the metalanguage of Tamil, the language used to describe the technical linguistic terms of the language and its structure, is also Tamil (rather than Sanskrit). According to a 2001 survey, there were 1,863 newspapers published in Tamil, of which 353 were dailies.

Grammar

Much of Tamil grammar is extensively described in the oldest known grammar book for Tamil, the Tolkāppiyam. Modern Tamil writing is largely based on the 13th century grammar Naṉṉūl which restated and clarified the rules of the Tolkāppiyam, with some modifications. Traditional Tamil grammar consists of five parts, namely eḻuttu, col, poruḷ, yāppu, aṇi. Of these, the last two are mostly applied in poetry.

Similar to other Dravidian languages, Tamil is an agglutinative language. Tamil is characterised by its use of retroflex consonants, like the other Dravidian languages. It also uses a liquid l (ழ) (example Tamil), which is also found in Malayalam (example Kozhikode), but disappeared from Kannada at around 1000 AD (but present in Unicode), and was never present in Telugu. Tamil words consist of a lexical root to which one or more affixes are attached. Most Tamil affixes are suffixes. Tamil suffixes can be derivational suffixes, which either change the part of speech of the word or its meaning, or inflectional suffixes, which mark categories such as person, number, mood, tense, etc. There is no absolute limit on the length and extent of agglutination, which can lead to long words with a large number of suffixes.

Indo-Aryan migration

Indo-Aryan migration is a necessary corollary of any model of Indo-European origins that locates the original Indo-European homeland outside the Indian subcontinent.

The Indo-Aryans derive from an earlier Proto-Indo-Iranian stage, usually identified with the Bronze Age Andronovo culture at the Caspian Sea, and Indo-Aryan migration to India is consequently assumed to have taken place in the Middle to Late Bronze Age, contemporary to the Late Harappan phase in India.

The linguistic center of gravity principle states that a language family's most likely point of origin is in the area of its greatest diversity. Take, for example, the Germanic languages—of which English is one. North America may have more speakers of Germanic languages, but almost all of them are exclusively or primarily speakers of English. Northern Europe, where the Germanic languages are known to have originated, has in significant numbers speakers not only of English but also German, Dutch/Flemish, and Swedish/Danish/Norwegian.

By this criterion, India, home to only a single branch of the Indo-European language family (i.e. Indo-Aryan), is an exceedingly unlikely candidate for the Indo-European homeland; Central-Eastern Europe, on the other hand, is home to the Italic, Venetic, Illyrian, Germanic, Baltic, Slavic, Thracian, and Greek branches of Indo-European.

Both mainstream Urheimat solutions locate the Indo-European homeland in the vicinity of the Black Sea.

Dialectical variation

It has long been recognized that a binary tree model cannot capture all linguistic alignments; certain areal features cut across language groups and are better explained through a model treating linguistic change like waves rippling out through a pond. This is true of the Indo-European languages as well. Various features originated and spread while Proto-Indo-European was still a dialect continuum. These features sometimes cut across sub-families: for instance, the instrumental, dative, and ablative plurals in Germanic, Baltic and Slavic feature endings beginning with -m-, rather than the usual -*bh-, e.g. Old Church Slavic instrumental plural synŭ-mi 'with sons', despite the fact that the Germanic languages are centum languages, while Baltic and Slavic are satem languages.

There is a close relationship between the dialectical relationship of the Indo-European languages and the actual geographical arrangement of the languages in their earliest attested forms that makes an Indian origin for the family unlikely. Given the geographic barriers separating the subcontinent from the rest of Eurasia, such as the Hindu Kush mountains and the existence of the various Indo-European sub-families, an Indian Urheimat would require several successive staggered migrations (cf. Out of India theory). However, this would destroy the close arrangement between archaic shared linguistic features and geographical arrangement noted above. This arrangement is better explained by a radial expansion of the Indo-Europeans, a corollary of which is the migration of Indo-Aryan speakers into the subcontinent. In addition, there is a series of successive innovations, from Proto-Indo-European, to Satem, Indo-Iranian, and Vedic Sanskrit that makes the latter a late descendant that in some respects has even younger features than the closely related Iranian (such as the sede perfect, vs. Avesta hazde).

Indo-Iranians

Origin

The Indo-Iranians are commonly identified with the Andronovo culture, and their homeland with an area of the Eurasian steppe that borders the Ural River on the west, the Tian Shan on the east (where the Indo-Iranians took over the area occupied by the earlier Afanasevo culture), and Transoxiana and the Hindu Kush on the south. Historical linguists broadly estimate that a continuum of Indo-Iranian languages probably began to diverge by 2000 BCE, if not earlier,[1]:38–39 preceding both the Vedic and Iranian cultures. The earliest recorded forms of these languages, Vedic Sanskrit and Gathic Avestan, are remarkably similar, descended from the common Proto-Indo-Iranian language. The origin and earliest relationship between the Nuristani languages and that of the Iranian and Indic groups is unrecoverably obscure..


The Indo-European language spoken by the Indo-Iranians in the late 3rd millennium BC was a Satem language still not removed very far from the Proto-Indo-European language, and in turn only removed by a few centuries from the Vedic Sanskrit of the Rigveda. The main phonological change separating Proto-Indo-Iranian from Proto-Indo-European is the collapse of the ablauting vowels *e, *o, *a into a single vowel, Proto-Indo-Iranian *a (but see Brugmann's law). Grassmann's law and Bartholomae's law were also complete in Proto-Indo-Iranian.

Indo-Aryan languages

The Indo-Aryan languages form a subgroup of the Indo-Iranian languages, which belong to the Indo-European family of languages. The term "Indic" refers to the same group without what some see as the negative connotations of "Aryan". Note that, unlike the generic adjective "Indian", "Indic" is the term used in the context of Indo-European linguistics, and is not strictly a geographical term; non-Indo-European languages spoken in India are not included in the term, while the Indo-Aryan superstrate in Mitanni, on the other hand, probably testifies to speakers of an Indic language that never settled on the Indian subcontinent.

SIL International in a 2005 estimate counted a total of 209 varieties, the largest in terms of native speakers being Hindustani (Hindi and Urdu, about 540 million), Bengali (about 200 million), Punjabi (about 100 million), Marathi (about 90 million), Gujarati (about 45 million), Nepali (about 40 million),Oriya (about 30 million), and Sindhi (about 20 million), with a total number of native speakers of more than 900 million.

History

The earliest evidence of the group is in Vedic Sanskrit, the language used in the ancient preserved texts of the Indian subcontinent, the foundational canon of Hinduism known as the Vedas. The Indo-Aryan superstrate in Mitanni is of similar age, but the only evidence is a number of loanwords.

In about the 7th century to the 5th century BCE, the Sanskrit language was codified and standardised by the grammarian Panini; this led (in about 200 BCE) to what is now known as "Classical" Sanskrit. However, although this preserved the integrity of the written language for a long time, the spoken language continued to evolve, and by the 6th century, Sanskrit as a spoken language was rare, being by and large replaced by its descendants, the Prakrits. All the Prakrits share a common ancestry, but they are not necessarily mutually intelligible.

In medieval times, the Prakrits diversified into various Middle Indic dialects. "Apabhramsa" is the conventional cover term for transitional dialects connecting late Middle Indic with early Modern Indic, spanning roughly the 6th to 13th centuries. Some of these dialects showed considerable literary production; the Sravakachar of Devasena (dated to the 930s) is now considered to be the first Hindi book.

The next major milestone occurred with the Muslim invasions of India in the 13th, 14th, 15th, and 16th centuries. Under the flourishing Mughal empire, Persian became very influential as the language of prestige of the Islamic courts. However, Persian was soon displaced by Urdu. This Indo-Aryan language is a combination with Persian elements in its vocabulary, with the grammar of the local dialects.

The two largest languages that formed from Apabhransa were Bengali and Hindi; others include Gujarati, Oriya, Marathi, and Punjabi.

In the Hindi-speaking areas, the main form was Braj-bhasha, which is still spoken today, but was replaced in the 19th century by the Khari Boli dialect. However, a large amount of modern spoken Hindi vocabulary is derived from Perso-Arabic.

This state of affairs continued until the Partition of India in 1947. Hindustani (a mixture of Urdu and Hindi) was replaced by Hindi as the official language of India, and soon the Perso-Arabic words of Urdu began to be excised from the official Hindi corpus, in a bid to make the language more "Indian". A return to Hindi poets such as Tulsidas resulted in what is known as a Sanskritisation of the language. Persian words in common parlance were slowly replaced by Sanskrit words, sometimes borrowed wholesale, or in new compounds. In contemporary times, there is a continuum of Hindi–Urdu, with heavily-Persianised Urdu at one end and Sanskritised Hindi at the other, although the basic grammar remains identical. Most people speak somewhere in the middle: Hindustani.

Languages of Asia
There is a wide variety of languages spoken throughout Asia, comprising a number of families and some unrelated isolates. Many languages have a long tradition of writing.

Indo-European languages, widely spoken in Southern, Western and Central Asia as well as Russia:Indo-Aryan languages: Sanskrit, Pali, Hindi, Urdu, Bengali, Nepali, Sinhalese, Assamese, Punjabi, Kashmiri, Sindhi, Oriya, Marathi, Bihari, Gujarati
Iranian languages: Persian, Kurdish, Pashto, Tajik, Baluchi
Germanic languages: English
Slavic languages: Russian
Sino-Tibetan languages:
Chinese language: Mandarin, Cantonese, Hakka
Tibeto-Burman languages: Tibetan, Burmese, Nepal Bhasa, Mizo
Semitic languages:
Arabic, Hebrew, Syriac, Aramaic
Dravidian languages:
Kannada, Malayalam, Tamil, Telugu, Tulu
Austronesian languages:
Atayal, Cebuano, Cham, Ilokano, Indonesian, Javanese, Malay, Paiwan, Sundanese, Tagalog, Tetum
Austro-Asiatic languages
Vietnamese, Khmer, Mon, Khasi, Nicobarese, Munda
Turkic languages:
Turkish, Azeri, Turkmen, Tatar, Uzbek, Kazakh, Uyghur
Mongolic languages
Mongolian, Buryat
Japonic languages
Japanese, Ryukyuan
Tai-Kadai languages
Thai, Hkamti, Lao
Uralic languages
Andamanese languages
Chukotko-Kamchatkan languages
Eskimo-Aleut languages
Yukaghir languages
The Buyeo languages are a hypothetical language family which includes Korean and the Japonic languages.
The Altaic languages are a somewhat disputed grouping including the Turkic, Mongolic, and Tungusic languages, with Korean and the Japonic languages included by some.
Another disputed language family, the Ural-Altaic languages, is formed with the addition of Uralic languages to the Altaic languages.
The Austric languages is a hypothesised language family grouping the Austronesian languages with the Austro-Asiatic languages

A number of isolated languages - languages with no demonstrable links to other tongues - are also spoken in Asia:
Burushaski language
Nihali language
Nivkh