An underlying period

A little while back we showed how it is possible to associate a beat with a particular point in the speech signal. Below you see what this beat assignment procedure looks like if applied to some continuous speech. In this case the subject was reading a prose passage, part of which is given at the bottom of the page. The undulating line indicates the energy content of the signal on which beat assignment is based and the vertical lines indicate beats.
many beats
Notice how irregular the beat intervals are. It is not an easy job to say if there is a rhythmic basis to this series of beats or not. However it can be done. The details are a little complex and are given in the thesis, but in essence we look at all interbeat intervals occurring over a good sized stretch of speech, say 30 sec worth, and do some statistics to see if we can identify some period, say p, which occurs frequently in the beat series, along with integer multiples of p. So if we find, for example, a tendency for intervals of 200, 400, 600, and 800 ms, this is a good indication that there is an underlying period here.

It is a little harder to say what the fundamental period is. For example, in this example it could be 400 ms, in which intervals of 200 ms would come about by insertion and intervals of 800 ms would arise from deletion. Both insertion and deletion are required to create intervals of 600 ms. Alternatively the basic period could be 200 ms, in which case all larger intervals arise by beat deletion. Or it could be 800 ms with a lot of beat insertion. You get the point. We can document the presence of a rhythm as a statistical tendency, without necessarily being able to write the musical score for the speech!

So what did you find?????

Basically, we found that when people read prose fluently there is often (but not always) evidence of an underlying period. Most interestingly, this tendency is much stronger when people are asked to read very quickly while still remaining intelligible. Some fast readings showed a surprisingly strong periodic component-comparable to that found in say a jazz recording.

Below you can see the beats and amplitude envelope from a small section of each of two readings. In each case the original text spoke here is " ...section, accenting the same beat". The top figure shows beats from the slower reading. Notice that the beats are quite irreguarly spaced. In the faster reading, the beats become much more regularly spaced. Listen to the two readings (the recordings provide a little more text than is shown in the figures).

One way in which the beat series has been regularized is by a stress shift. In the slower reading the subject says ...SEC-tion AC-cent-ing the SAME beat while in the faster reading she says ...SEC-tion ac-CENT-ing the SAME beat, with a stress on the second syllable of accenting. Although this would be a funny place to stress the word if spoken in isolation, it seems to be ok here as it helps to establish the regular beat period.

Beats from a slower reading

beats from a fast reading

Slower reading

slower reading (.au) slower reading (.wav)

Beats from a faster reading

beats from a fast reading

Faster reading

faster reading (.au) faster reading (.wav)

So what?

This second experiment showed that if we conceive of speech rhythm as a succession of beats, we can indeed find cases where the beats line up neatly, and we can distinguish rhythmical readings from less regular ones. Furthermore, it looks as if speech is more rhythmical when people speak in a rapid and fluent manner than when they are free to pepper their speech with pauses and hesitations. No big surprise perhaps, but it is reassuring to be able to document the fact in an experiment.

In the following page, the last of the present tour, the results of the two experiments presented here will be tied together and some speculation about their import for the study of action and coordination will be entertained.

Next page

The prose passage used in this experiment, from Beneath the Underdog, Charles Mingus, Vintage Books, 1971.
There once was a word used---swing. Swing went in one direction, it was linear, and everything had to be played with an obvious pulse and that's very restrictive. But I use the term `rotary perception.' If you get a mental picture of the beat existing within a circle you're more free to improvise. People used to think the notes had to fall on the center of the beats in the bar at intervals like a metronome, with three or four men in the rhythm section accenting the same pulse. That's like parade music or dance music. But imagine a circle surrounding each beat---each guy can play his notes anywhere in that circle and it gives him a feeling he has more space. The notes fall anywhere inside the circle but the original feeling for the beat isn't changed. If one in the group loses confidence, somebody hits the beat again. The pulse is inside you. (Mingus, 1971, p.~350--351)



This page was last updated 09/15/99
URL: http://www.cs.indiana.edu/rhythmsp/acih.html
Contact: Robert Port, port@indiana.edu
Copyright 1999, The Trustees of Indiana University