托福阅读素材之埃及罗塞塔石碑(三)


时间:2017/5/21
作者:辛达托福代报名小编
-返回首页 / 返回文章列表


 辛达托福代报考位:到2016年5月21日上午托福官网没有释放考位,在此提醒广大考生抓紧时间报名,注意自己的考试时间,如对雅思报名有任何疑问,欢迎咨询在线客服

  7:34

  What other properties of language does the script show? Languages contain patterns. If I give you the letter Q and ask you to predict the next letter, what do you think that would be? Most of you said U, which is right. Now if I asked you to predict one more letter, what do you think that would be? Now there's several thoughts. There's E. It could be I. It could be A, but certainly not B, C or D, right? The Indus script also exhibits similar kinds of patterns. There's a lot of text that start with this diamond-shaped symbol. And this in turn tends to be followed by this quotation marks-like symbol. And this is very similar to a Q and U example. This symbol can in turn be followed by these fish-like symbols and some other signs, but never by these other signs at the bottom. And furthermore, there's some signsthat really prefer the end of texts, such as this jar-shaped sign, and this sign, in fact, happens to be the most frequently occurring sign in the script.

  8:24

  Given such patterns, here was our idea. The idea was to use a computer to learn these patterns, and so we gave the computer the existing texts. And the computer learned a statistical model of which symbols tend to occur together and which symbols tend to follow each other. Given the computer model, we can test the model by essentially quizzing it. So we could deliberately erase some symbols,and we can ask it to predict the missing symbols. Here are some examples. You may regard this as perhaps the most ancient game of Wheel of Fortune.

  9:04

  What we found was that the computer was successful in 75 percent of the cases in predicting the correct symbol. In the rest of the cases, typically the second best guess or third best guess was the right answer. There's also practical use for this particular procedure. There's a lot of these texts that are damaged. Here's an example of one such text. And we can use the computer model now to try to complete this text and make a best guess prediction. Here's an example of a symbol that was predicted. And this could be really useful as we try to decipher the script by generating more data that we can analyze.

  9:36

  Now here's one other thing you can do with the computer model. So imagine a monkey sitting at a keyboard. I think you might get a random jumble of letters that looks like this. Such a random jumble of letters is said to have a very high entropy. This is a physics and information theory term. But just imagine it's a really random jumble of letters. How many of you have ever spilled coffee on a keyboard?You might have encountered the stuck-key problem -- so basically the same symbol being repeated over and over again. This kind of a sequence is said to have a very low entropy because there's no variation at all. Language, on the other hand, has an intermediate level of entropy; it's neither too rigid,nor is it too random. What about the Indus script? Here's a graph that plots the entropies of a whole bunch of sequences. At the very top you find the uniformly random sequence, which is a random jumble of letters -- and interestingly, we also find the DNA sequence from the human genome and instrumental music. And both of these are very, very flexible, which is why you find them in the very high range. At the lower end of the scale, you find a rigid sequence, a sequence of all A's, and you also find a computer program, in this case in the language Fortran, which obeys really strict rules. Linguistic scripts occupy the middle range.

  10:49

  Now what about the Indus script? We found that the Indus script actually falls within the range of the linguistic scripts. When this result was first published, it was highly controversial. There were people who raised a hue and cry, and these people were the ones who believed that the Indus script does not represent language. I even started to get some hate mail. My students said that I should really seriously consider getting some protection. Who'd have thought that deciphering could be a dangerous profession? What does this result really show? It shows that the Indus script shares an important property of language. So, as the old saying goes, if it looks like a linguistic script and it acts like a linguistic script, then perhaps we may have a linguistic script on our hands. What other evidence is there that the script could actually encode language?

  11:38

  Well linguistic scripts can actually encode multiple languages. So for example, here's the same sentence written in English and the same sentence written in Dutch using the same letters of the alphabet. If you don't know Dutch and you only know English and I give you some words in Dutch,you'll tell me that these words contain some very unusual patterns. Some things are not right, and you'll say these words are probably not English words. The same thing happens in the case of the Indus script. The computer found several texts -- two of them are shown here -- that have very unusual patterns. So for example the first text: there's a doubling of this jar-shaped sign. This sign is the most frequently-occurring sign in the Indus script, and it's only in this text that it occurs as a doubling pair.

大家在进行托福备考的同时,如果对考位方面也比较紧张,可联系辛达代报为您服务



☆转载声明: 各位同行和网友们,欢迎转载或引用在本站的文章,敬请标注原文出自辛达托福代报网!

其他文章推荐

词汇量对阅读分数有影响吗

影响托福阅读能力的关键

托福写作和雅思写作的区别

30分钟内写出高质量托福作文

近义掌握托福词汇

辛达代报名网站编辑部



上一篇:托福阅读素材之埃及罗塞塔石碑(二)

下一篇:托福阅读素材之埃及罗塞塔石碑(四)