Tuesday, May 12, 2009

INDUS SCRIPT MAY BE A LANGUAGE

An ancient script that's defied generations of archaeologists has yielded some of its secrets to artificially intelligent computers.

Computational analysis of symbols used 4,000 years ago by a long-lost Indus Valley civilization (on the present-day border between Pakistan and India) suggests they represent a spoken language. Some linguists thought the symbols were merely pretty pictures. "The underlying grammatical structure seems similar to what's found in many languages," said University of Washington computer scientist Rajesh Rao.

The Indus script, used between 2,600 and 1,900 BCE in what is now eastern Pakistan and northwest India, belonged to a civilization as sophisticated as its Mesopotamian and Egyptian contemporaries. However, it left fewer linguistic remains. Archaeologists have uncovered about 1,500 unique inscriptions from fragments of pottery, tablets and seals. The longest inscription is just 27 signs long.

Rao, a machine learning specialist who read about the Indus script in high school and decided to apply his expertise to the script while on sabbatical in Inda, may have solved the language-versus-symbol question, if not the script itself. "One of the main questions in machine learning is how to generalize rules from a limited amount of data," said Rao. "Even though we can't read it, we can look at the
patterns and get the underlying grammatical structure."

Rao's team used pattern-analyzing software running what's known as a Markov model, a computational tool used to map system dynamics. When they seeded the program with fragments of Indus script, it returned with grammatical rules based on patterns of symbol arrangement. These proved to be moderately ordered, just like spoken languages. As for the meaning of the script, the program remained silent.

But according to Rao, this early analysis provides a foundation for a more
comprehensive understanding of Indus script grammar, and ultimately its meaning. "The next step is to create a grammar from the data that we have," he said. "Then we can ask, is this grammar similar to those of the Sanskrit or Indo-European or Dravidian languages? This will give us a language to compare it to."

0 Comments:

Post a Comment

<< Home