<div style="text-align:justify;"> <span style="font-family:Verdana;">Though many different readability metrics have been created, there still is no universal agreement defining readabil...<div style="text-align:justify;"> <span style="font-family:Verdana;">Though many different readability metrics have been created, there still is no universal agreement defining readability of software source code. The lack of a clear agreement of source code readability has ramifications in many areas of the software development life-cycle, not least of which being software maintainability. We propose a measurement based on Linguistic Economy to bridge the gap between mathematical and behavioral aspects. Linguistic Economy describes efficiencies of speech and is generally applied to natural languages. In our study, we create a large corpus of words that are likely to be found in a programmer’s vocabulary, and a corpus of existing identifiers found in a collection of open-source projects. We perform a usage analysis to create a database from both of these corpora. Linguistic Economy suggests that words requiring less effort to speak are used more often than words requiring more effort. This concept is applied to measure how difficult program identifiers are to understand by extracting them from the program source and comparing their usage to the database. Through this process, we can identify source code that programmers find difficult to review. We validate our work using data from a survey where programmers identified unpleasant to review source files. The results indicate that source files identified as unpleasant to review source code have more linguistically complicated identifiers than pleasant programs.</span> </div>展开更多
With the extensive application of corpus in foreign language teaching,data-driven English vocabulary teaching has opened up a broad prospect,especially in curriculum design,classroom teaching methods and language test...With the extensive application of corpus in foreign language teaching,data-driven English vocabulary teaching has opened up a broad prospect,especially in curriculum design,classroom teaching methods and language testing,etc.Corpus presents a large number of authentic language materials in the form of data and context,which helps to attract the attention of language learners and strengthen their memory.This paper mainly discusses the application of corpus in vocabulary teaching from the aspects of word frequency,collocation,semantic prosody and so on-taking synonyms‘quite’,‘rather’,‘fairly’as an example.展开更多
This study was concerned with the short vowels in modern standard Arabic words with Consonant Vowel-Consonant Vowel-Consonant Vowel (CVCVCV) structure, and the long vowels in words with Consonant Vowel Vowel-Consonant...This study was concerned with the short vowels in modern standard Arabic words with Consonant Vowel-Consonant Vowel-Consonant Vowel (CVCVCV) structure, and the long vowels in words with Consonant Vowel Vowel-Consonant (CVVC). Even though there has been a dispute on the precise number of Arabic vowels that exist between language studies, this study used the opinion that the Arabic language has three vowels;the elongation of each vowel gave the other three because this is the opinion of classical Arabic linguists which is the source of the Modern Standard Arabic (MSA). Studies said that the first and second formant values (F1, F2) can represent the vowels. In this study, the formants were measured using LPC (Linear Predictive Coding), verifying the measurement to see if the measured follows the pattern of formants measurements of the other studies, and the formants were used to investigate the relationship between short and long vowels. Furthermore, the study figured out if the dialect of speakers can affect the values of formants, even if the spoken language is MSA, some statistical measurements were calculated to evaluate the relationship.展开更多
Based on the Chinese-English parallel corpus of Yu Hua's novel To Live ( the first two sections), this paper focuses on analyzing Chinese reporting verb"shuo"and its translation through concordance, conc...Based on the Chinese-English parallel corpus of Yu Hua's novel To Live ( the first two sections), this paper focuses on analyzing Chinese reporting verb"shuo"and its translation through concordance, concordance plot, collocates, word list and other functions of AntConc. It aims to study translator Michael Berry's diction style and investigate how the source text has influenced the target text.展开更多
Under the framework of system-functional linguistics theory,this article researches some content of Rotterdam rules and Customs of the People's Republic of China.Through Antconc,a tool for concordance in corpus,th...Under the framework of system-functional linguistics theory,this article researches some content of Rotterdam rules and Customs of the People's Republic of China.Through Antconc,a tool for concordance in corpus,this work compares and analyzes two legal texts in terms of modality,thematic structure and information structure,and concludes the differences in linguistic structure between English and Chinese shipping conventions.On this basis,with the three principles of purpose,coherence and faithfulness in functional theory,this work finds the functional approaches and strategies of legal and customs transport translation so as to provide some new perspectives for maritime convention translation.展开更多
This paper takes Wuhan’s traffic signs as the research object,and collects the Chinese and English traffic signs in large quantities to build a small corpus.According to the text type theory proposed by German functi...This paper takes Wuhan’s traffic signs as the research object,and collects the Chinese and English traffic signs in large quantities to build a small corpus.According to the text type theory proposed by German functionalist school Katarina Rice,in this article the traffic signs are classified into three categories:information type text identifier,expression type text identifier and opera?tion type text identifier according to six functions including indication,prompt,restriction,compulsory,persuasion and publicity.It attempts to reveal the characteristics of Chinese signs and English translations of different text types,and to explore the translation and semantic rhyme of the word"forbidden"in the English translation of high-frequency vocabulary in traffic signs.It aims to pro?vide reference for the English translation of traffic signs,create a good language environment,shape a good city image,and increase the degree of China's internationalization.展开更多
The discrimination of synonyms has always been the key and difficult points in language learning.By using the British Na?tional Corpus(BNC)to retrieve the standard frequency,registers and collocations of the three com...The discrimination of synonyms has always been the key and difficult points in language learning.By using the British Na?tional Corpus(BNC)to retrieve the standard frequency,registers and collocations of the three common words,DESTORY,RUIN and DEMOLISH,this paper analyzes their similarities and differences.Besides,this paper demonstrates the strong ancillary func?tion of corpus in synonyms discrimination as well as in English teaching and learning.展开更多
文摘<div style="text-align:justify;"> <span style="font-family:Verdana;">Though many different readability metrics have been created, there still is no universal agreement defining readability of software source code. The lack of a clear agreement of source code readability has ramifications in many areas of the software development life-cycle, not least of which being software maintainability. We propose a measurement based on Linguistic Economy to bridge the gap between mathematical and behavioral aspects. Linguistic Economy describes efficiencies of speech and is generally applied to natural languages. In our study, we create a large corpus of words that are likely to be found in a programmer’s vocabulary, and a corpus of existing identifiers found in a collection of open-source projects. We perform a usage analysis to create a database from both of these corpora. Linguistic Economy suggests that words requiring less effort to speak are used more often than words requiring more effort. This concept is applied to measure how difficult program identifiers are to understand by extracting them from the program source and comparing their usage to the database. Through this process, we can identify source code that programmers find difficult to review. We validate our work using data from a survey where programmers identified unpleasant to review source files. The results indicate that source files identified as unpleasant to review source code have more linguistically complicated identifiers than pleasant programs.</span> </div>
文摘With the extensive application of corpus in foreign language teaching,data-driven English vocabulary teaching has opened up a broad prospect,especially in curriculum design,classroom teaching methods and language testing,etc.Corpus presents a large number of authentic language materials in the form of data and context,which helps to attract the attention of language learners and strengthen their memory.This paper mainly discusses the application of corpus in vocabulary teaching from the aspects of word frequency,collocation,semantic prosody and so on-taking synonyms‘quite’,‘rather’,‘fairly’as an example.
文摘This study was concerned with the short vowels in modern standard Arabic words with Consonant Vowel-Consonant Vowel-Consonant Vowel (CVCVCV) structure, and the long vowels in words with Consonant Vowel Vowel-Consonant (CVVC). Even though there has been a dispute on the precise number of Arabic vowels that exist between language studies, this study used the opinion that the Arabic language has three vowels;the elongation of each vowel gave the other three because this is the opinion of classical Arabic linguists which is the source of the Modern Standard Arabic (MSA). Studies said that the first and second formant values (F1, F2) can represent the vowels. In this study, the formants were measured using LPC (Linear Predictive Coding), verifying the measurement to see if the measured follows the pattern of formants measurements of the other studies, and the formants were used to investigate the relationship between short and long vowels. Furthermore, the study figured out if the dialect of speakers can affect the values of formants, even if the spoken language is MSA, some statistical measurements were calculated to evaluate the relationship.
文摘Based on the Chinese-English parallel corpus of Yu Hua's novel To Live ( the first two sections), this paper focuses on analyzing Chinese reporting verb"shuo"and its translation through concordance, concordance plot, collocates, word list and other functions of AntConc. It aims to study translator Michael Berry's diction style and investigate how the source text has influenced the target text.
文摘Under the framework of system-functional linguistics theory,this article researches some content of Rotterdam rules and Customs of the People's Republic of China.Through Antconc,a tool for concordance in corpus,this work compares and analyzes two legal texts in terms of modality,thematic structure and information structure,and concludes the differences in linguistic structure between English and Chinese shipping conventions.On this basis,with the three principles of purpose,coherence and faithfulness in functional theory,this work finds the functional approaches and strategies of legal and customs transport translation so as to provide some new perspectives for maritime convention translation.
文摘This paper takes Wuhan’s traffic signs as the research object,and collects the Chinese and English traffic signs in large quantities to build a small corpus.According to the text type theory proposed by German functionalist school Katarina Rice,in this article the traffic signs are classified into three categories:information type text identifier,expression type text identifier and opera?tion type text identifier according to six functions including indication,prompt,restriction,compulsory,persuasion and publicity.It attempts to reveal the characteristics of Chinese signs and English translations of different text types,and to explore the translation and semantic rhyme of the word"forbidden"in the English translation of high-frequency vocabulary in traffic signs.It aims to pro?vide reference for the English translation of traffic signs,create a good language environment,shape a good city image,and increase the degree of China's internationalization.
文摘The discrimination of synonyms has always been the key and difficult points in language learning.By using the British Na?tional Corpus(BNC)to retrieve the standard frequency,registers and collocations of the three common words,DESTORY,RUIN and DEMOLISH,this paper analyzes their similarities and differences.Besides,this paper demonstrates the strong ancillary func?tion of corpus in synonyms discrimination as well as in English teaching and learning.