1. Distance Functions
The pitch of a musical note (hereafter abbreviated as a note) is defined by its frequency measured in Herz (Hz), cycles per second.
The bigger the frequency, the higher the pitch.
The frequencies of all musical notes are the set of positive reals
. The interval between them is expressed by the ratio of their frequencies. As will be
shown below, in just intonation,
and
and the interval between them are calculated by division
. The reason for this is that use is
made of the logarithmic distance and the subtraction of logarithms is expressed as division of the anti-logs. In turn, the main reason why the logarithmic distance is used comes from the Weber-Fechner law (cf. e.g. [6] ) to the effect that stimuli are sensed logarithmically. For two notes a and b define
Then this is a distance function. For
,
and
Hence,
is a metric space. Thus, we think of intervals between two notes as the logarithmic difference, which we substitute by the quotient, say b/a, of corresponding frequencies
and say that the interval between a and b is b/a (by which we understand the logarithmic interval
).
2. Equivalence Classes
We introduce the relation
by
. (2.1)
Then this is an equivalence relation. For
, and if
, then
. Finally, if
and
, then
. Hence
is classified into equivalence classes:
.
This may be also viewed group-theoretically. The multiplicative group
or its subgroup
of all positive rationals contains the cyclic subgroup
. And the quotient group
may be considered as the set of all musical notes up to octaves.
Two musical notes whose frequencies are different by powers of 2 are thought of as belonging to the same equivalence class, or those which sound alike. Musicians express the equivalence relation by using the same label for those notes in the same class. E.g. if the middle C has frequency 260 Hz, then the note three octaves higher than middle C has frequency
Hz while the note two octaves below middle C has frequency
Hz. They are denoted by the same symbol C and are referred to high C or low C, respectively. The concert pitch which is used today is
. Then with this value, in Pythagoras scale, the middle
while in just intonation it is
, cf. also (6.1).
Definition 1. Two intervals which combine to give an octave is called an inversion to each other.
After the octave, the next simplest is the perfect fifth 3:2 containing 7 semitones whose inversion is the perfect fourth 4:3 containing 5 semitones. The major third 5:4 is the interval containing 4 semitones whose inversion is the minor sixth 8:5 which contains 8 semitones (Figure 1 & Figure 2).
The sequence of three notes arranged in the order of the major third and the minor third is called a major triad.
The common major chord (do-mi-so-do) has the ratio
, while the common minor chord (do-miç-so-do) has the ratio
(Figure 3).
Remark 2.1 For the meaning of “fifth’’, cf. Remark 5.1 below. There are three ratios with the prefix “perfect’’, which are perfect first (which is nothing but the unison), perfect fifth and perfect fourth. The major third is the interval from
to
, which is a representative of the equivalence class consisting of the intervals 5/4, 5/2, 5/1, 10/1,….
3. Pythagoras Law of Small Numbers and the Helmholtz-Joahim Scale
Pythagoras was the first who raised the question “Why is consonance associated with the ratios of small numbers?’’ Here “numbers’’ means integers. This is referred to as Pythagoras’ law of small numbers to the effect that only the small integer multiples of the fundamental notes can create harmony and consonance. There is a variation of this law, the law of cyclotomic numbers, cf. §7.
In this section as a partial explanation of Pythagoras’ law of small numbers, we refer to the experiment of Helmholtz and Joahim stated in [3] and we refer to
their result as the Helmhotz-Joahim scale. H. von Helmholtz with the help of the renowned violinist J. Joachim, made an experiment and tabulated the notes which are the most pleasing to the ears (Table 1).
Table 2 is the basis of Coxeter’s speculation of the law of cyclotomic numbers in §7.
4. Various Scales
When we pile up the notes starting from a basic note, we obtain an increasing sequence of pitches. The way of piling up is not at one’s disposal and one has to struggle to achieve good consonants among them. There are several sequences constructed and they are called a scale or sometimes pitch with the inventor’s name or with the describing term (sometimes also called an intonation).
In the case of Pythagorean scale (§5), it is formed using only powers of 2 and 3.
Since one whole-tone consists of two semi-tones and there are two semi-tones and five whole-tones in one octave, it follows that one octave has 12 semitones, cf. (8.6) below. Therefore, piling up the notes on the basic one, the 12th power is
very important. In the case of the Pythagorean scale, what are piled up are powers of
, so that
which is a little higher than 7 octaves:
. The interval
is known as the comma of Pythagoras. This discrepancy accounts for many difficulties in obtaining an organized system of pitches.
In the equal tempered system, one octave is equally divided into 12 semitones with ratio being the 12th root of unity
(4.1)
(a) (b)
Table 1. (a) Helmholtz-Joachim scale. (b) Helmholtz-Joachim scale (cont.).
5. The Pythagorean Scale
The Pythagorean scale is made of
,
,
. As is explained above, starting from middle C, A (la) is obtained as
.
From middle C (doh) by piling up
, we get G (sol), then
, which is D (re). The next is
, which is A (la) with frequency 440 Hz. The 5th is
, which is E (mi).
Definition 2. These 5 notes C, D, E, A, G form the pentatonic scale. It is the basis of many folk-songs in Scotland. Figure 4.
The 6th is
, which is B (si). Thus C (doh), G (sol), D (re), A (la), E (me), B (si). The 7th F (fa) is obtained by coming back from higher C. i.e.
.
This is the scale used by the Greeks and early medieval composers as the basis of the Ecclesiastic Mode. The scale is suitable for melodic writing but not satisfactory for harmonic writing (for modulation) (Figure 5).
We remark that the sequence
is very much relevant to number
theory, esp. in connection with the Waring problem, cf. e.g. [7] , where the Gaussian symbol will be defined after (5.1).
5.1. Major and Minor Chords
The home key could be any one of seven notes but what survived in tone-centered music nowadays are the Ionian scale beginning and ending on C and the Aeolian scale (with key-note A). The Ionian and Aeolian scales are known as the ordinary major and minor scales (Table 3).
Definition 3. The major triad is the piling up of the major third followed by the perfect fifth on the root. The minor triad is the piling up of the minor third followed by the perfect fifth on the root (Figure 6).
On [2] there are two speculations for the reason why those two chords remain. One is that the interval from si to do is very small and gives the impression that it is coming to an end. The other is more convincing that as one can see, the major triads are situated symmetrically over the octave, and so these chords survived. But although in the figure it is apparent, it is not certain whether one senses this symmetry by ears. We guess that this is not a reasoning by senses but by reasons. On [2] , it is stated that Bach’s music is quite mathematical but for Bach, music comes first and the accompanying mathematical structure is rather the by-products of composing music―as mathematics of senses. [8] [9] [10] [11] etc. on Bach’s music and its plausible relation with mathematics.
5.2. Arithmetic of the Pythagorean Scale
In §5, the piling up method is given of constructing Pythagorean scale, but each time one has to think which power p of
is to be multiplied to shift the note into the interval
. In this section we show the following theorem which gives the value of the exponent p of 2 uniquely, given q of the exponent of 3. Here we extend the range of q to
. We appeal to the numerical values
Table 3. The Ionian and Aeolian scales are known as the ordinary major and minor scales.
where the logarithm means the natural logarithm.
Theorem 5.1 1) The unique integer p satisfying the conditions
(5.1)
is
, where the symbol
indicates the integral part (the Gauss symbol) of the real number x.
2) For an integer p to exist that satisfies
(5.2)
the inequality
(5.3)
must hold. Cf. Table 4 & Table 5.
Proof. The restriction (5.1) amounts to
and the interval
has length exactly 1, so that it contains a unique integer, which is
. (5.3) follows similarly. □
Remark 5.1. 1) We note that the value
for Fis invalidates the law of small numbers in §3. In order to overcome this difficulty, we choose the fraction x such that
, i.e.
. Then this awkward number appears as the distance between B and F:
. Also the fractions
,
,
appear as the difference between D and (high) C, F and D, and E and C.
2) From Figure 5.1 we see that there are 5 tones under G including G itself: C, D, E, F, G, thus justifying the name “5th’’ in Definition 1. 2) The semi-tone
and # in Definition 4 are called limma and apotome, respectively.
Table 4. Values associated to Theorem 5.1.
Table 5. Frequency ratios in Pythagorean scale.
(5.4)
6. The Verdi Pitch
The Verdi pitch is the scale which fixes the freq. of A to be 432. In [6] the following theorem has been found.
Theorem 6.1 The reason for the
is that in the Pythagorean scale,
(6.1)
This makes some of the calculations simpler but this was not used much. The reason is that the instruments are tuned not from C but from A and in just
intonation it gives a fractional frequency to A:
. This
is approximately the standard pitch used in the time of Beethoven (Figure 7).
7. The Just Intonation and the Law of Cyclotomic Numbers
The Pythagorean major 3rd
is slightly bigger than the major 3rd
of
just intonation. The Pythagorean major 3rd is said to let the melody sound beautifully, but it diminishes harmony because of the beats contained. In
the number of beats caused by the 4 times and 5 times of m is
which is 16.5 times/s for
Hz, say. Hence in the Pythagorean major triad
there occur beats between the root and the major 3rd. To eliminate this beat, we decrease the major third by multiplying by the syntonic comma
to make it
. This yields the just intonation, in which the major 3rd consists of
Figure 8 and Table 6.
On the basis of Table 2, Coxeter [3] states a speculation that the agreeable harmonics
(7.1)
correspond to the number of sides of regular polygons that can be constructed by a ruler and a compass. A natural inverse question is where there are corresponding harmonics to 15 and 17 since 15-gon was constructed by Euclid while 17-gon was by Gauss in 1797. The interval 15 from a low C to high B which thrilled the audience appears as the appoggiatura in the end of St. Matthew Passion (Figure 9). The interval 17 were used by the remaining two of the 3 B’s, i.e. by Beethoven and Brahms. Since regular polygons inscribed in a circle divides it into equal parts, cyclotomy, Coxeter refers to the law of cyclotomic numbers rather than the law of small numbers.
Figure 9. Appoggiatura at the end of St. Matthew Passion.
Table 6. Frequency ratios in just intonation scale.
8. Modulation
Part of the contents of this section is an elaboration of material in [4] . In this section we use the notation
, which are the initials in tonic sol-fa notation, doh, ray, me fah, soh, lah, te, doh. We also think of this as an increasing sequence of frequencies denoted by the corresponding symbols. We may conveniently write
(8.1)
to express the ratios of each note against the basic d. We consider the modulation from C to G. In order to express the correspondence, we expand (8.1) to obtain
(8.2)
To modulate C major into G major is just to replace the basic note d in (8.2) by s by which we obtain the sequence, thereby adding the prime to indicate the correspondence
(8.3)
where we used the relation
. In general by comparing the frequencies, we obtain
(8.4)
Also comparing the ratios in (8.2) and (8.3), we find that the ratios with prime are equal to those without:
(8.5)
for all
, e.g.
, the far-right side member we denote by q.
Then (8.5) amounts to
With d and q as parameters, we may express all other notes in terms of them. First
. Using (8.4) and (8.5), we have
, whence
. Similarly, from
we have
. Since
we deduce that
. From
we deduce that
. From
, it follows that
. Finally, from
, it follows that
.
Theorem 8.1 The major chord in Table 3 is expressed as
while the minor chord as
Corollary 8.1 The Pythagorean scale is the case with
.
Remark 8.1 Although Theorem 8.1 explains the reason why f is chosen as
,
it is a typical hindsight since Pythagoras had no concept of modulation and he must have followed the reasoning given in Remark 5.1, 1) based on the law of small numbers.
Corollary 8.2 There are three major triads,
,
,
.
Proof. There are three major thirds
and so the piling up of minor thirds are possible only for those starting with
. □
Definition 4 The major 2nd is usually called tome and denoted by w and the minor 2nd is called semi-tome denoted
:
1 octave consists of 5 tones and 2 semi-tones:
(8.6)
in conformity with the statement in §4. Table 7.
9. Equal Temperament
The law of cyclotomic numbers work well within the same key, but produces a contradiction if we modulate into other keys. In Table 6, the whole tone from Eç
to F or from G to A is
while for the tone from F to G is
and they are different by
, which the Greeks called a comma. To overcome
this difficulty, the well-tempered scale was invented shortly before the time of Bach (Figure 10). Bach composed “Well-tempered Clavier’’. a masterpiece
Table 7. Full range of musical intervals.
collection of 48 preludes and fugues. In this system every semitone is exactly the 12th root of 2 as given in (4.1).
Theorem 9.1 The infinite group of tempered intervals
modulo octaves is isomorphic to the cyclic group
, i.e.
The proof follows from the isomorphism theorem on noting that the kernel of the homomorphism
is
.
Example 1 As defined in §1, the major third consists of 4 semi-tones, and so in equal tempered system it is
. The interval from
to
being
, it is the major third corresponding to 5/4. It is a representative of the equivalence class consisting of the intervals
10. Bach’s Fifth
Recall Corollary 8.2. We consider Kellner’s tuning, cf. e.g. [11] . In the major
triad
, we make q a little smaller than natural
5th so that the number of beats caused by three times the root and twice the 5th note be equal to that of the beat caused by the four times the the 3rd and the 5th times of the root, i.e. writing such a q by qB, we have
whence we are led to the Quartic Equation
(10.1)
called Bach’s equation. Numerically
(10.2)
Solution of Quintic Equation
t is well-known that equations with rational (or algebraic) coefficients of degree ≤4 are soluble algebraically, i.e. by ordinary arithmetic operations plus extraction of roots. For our curiosity we shall find the (real) roots of a quintic equation
(10.3)
with
, i.e. (15). This can be solved using its resolvent equation of degree 3:
which is in the present case
(10.4)
with discriminant
Since
, it follows that the Galois group of (10.3),
, is the 4th symmetric group
. There exists an explicit formulas for solutions due to Cardano but it is too cumbersome to state. Here we adopt the method of Ohtuska [4] , which is indeed Cardano’s method. To solve (10.4) one puts
(10.5)
Then (10.4) becomes
whence
. On the other hand, (10.5) is easily soluble:
and we choose
. At the same time,
, so that
.
Now recall how the resolvent Equation (10.4) was derived. With y a new variable and a a parameter, we add
to both sides of
with
to obtain
(10.6)
provided that
or
(10.7)
For this to coincide with the resolvent Equation (10.4) we choose
and write
for y. Hence (10.6) amounts to
(10.8)
whence
(10.9)
which gives the value (10.2) for Bach’s fifth.
11. Music as Duende
11.1. Art vs. Technique
A singer is said to be non-standard original if he or she sings in his or her own original way, not necessarily following exactly the musical notes. The same applies to machine music.
The non-standard singer sings a song in his or her original way so that the song is much more appealing to the minds than scholarly way of singing according exactly to the musical notes or machine music.
Thus the music heavily depends on the talent of performers and those non-standard original singers or players are those who are talented enough to grasp the message and intension of the composer to convey it to the audience by expressing it by their “duende’’.
This class of singers include Hibari Misora, Hideaki Tokunaga, Yumi Matsutoya, or more recent ones, Hikaru Utada, Lia., et al.
It can be speculated that we feel comfortable when we receive stimuli whose frequency are the same as ours, i.e. 1/f-noise. This includes the case of music. If it shows 1/f-noise, then it gives us comfort.
11.2. HiFi vs. WiFi
In ( [12] , pp. 30-33) the principle of CD is stated. The pitch of the sound can be divided into
parts because a CD can record 16 convex-concave as one information and transforms into 0 and 1 signal. The CD reads these 16 information 44,100 times per second. The reason for this depends on the assumption that human ears can hear the sound whose frequencies are up to
. Since the sound with frequency 20,000 Hz oscillates 20,000 times per second and so more than this times of sampling is needed. And for stereo recording, we need twice as many, whence the sampling frequency 44,100.
The assumption that all the frequencies higher than 20,000 Hz may be neglected is rather controversial. For recording simple conversation may not need more, but as supreme art, this omission can be a serious problem because what is missing is often more meaningful as art. We recall that as soon as the real sound is transformed into digital signals, it is not the real signal but an approximation.
Not only in music but flower arrangement, it is said that what is important is rather the space surrounding the arranged flowers. We recall a poem of R. Kinoshita
Peony flowers
So stable and in full bloom
The solidness of the position
The flowers occupy.
12. Fluctuations
On fluctuations, there is a bunch of literature. We mention only a few [13] [14] [15] [16] [17] . Let
be a signal and consider its discrete Fourier transform (DFT)
(12.1)
for the frequency f. Let the power spectrum be defined by
(12.2)
For a more general treatment of finite power signals, we refer to [1] .
It was found (e.g. [15] ) that many natural phenomena possess the power spectrum depending on the frequency
. This kind of phenomena is called 1/f-noise or fluctuation.
In [17] , the distribution of primes is considered from the point of view of 1/f-fluctuation. However, in [18] , there is criticism against the interpretation of power spectrum, saying that it applies to non-living.
We are to study music as something that lies at this threshold of living and non-living.