On the Teaching of the Theory of Vectors Spaces in First Year of French Science University

Jean-Luc Dorier



Abstract:

In the French tradition of Bourbaki, the theory of vector spaces is usually presented in a very formal setting, which causes severe difficulties to many students. The aim of this paper is to analyze the underlying reasons of these difficulties and to suggest some ways to make the first teaching of the theory of vector spaces less inefficient for many students. We do not reject the necessity for formalism. On the contrary, on the basis of a historical analysis we can explain the specific meaning it has in the theory. From this mathematical analysis with a historical perspective, we analyze the teaching and the apprehension of vector space theory in a new approach. For instance, we will show that mistakes made by many students can be interpreted as a result of a lack of connection between the new formal concepts and their conceptions previously acquired in more restricted, but more intuitively based areas. Our conclusions will not plead for avoiding formalism but for a better positioning of the formal concepts with regard to previous knowledge of the students as well as special care to be given in making the role and the meaning of formalism in linear algebra explicit to the students.


Introduction

In collaboration with A. Robert, J.Robinet, and M. Rogalski, we have developed a research program on the learning and teaching of linear algebra in the first year of French science universities. This work, which started some ten years ago, includes the elaboration and evaluation of experimental teaching based on a substantial historical study and a theoretical approach within the French context of "didactique des mathmeatiques"(the French name for a field of research which would more or less correspond to mathematical education research in the USA). In this paper, we will try to summarize the main issues of our research, focusing the concepts of linear dependence and independence.

If a system of linear equations has as many equations as unknowns (n), dependence between the equations of the system may be understood in different ways. If one is not familiar with the notion of linear dependence, but more concerned with solving the system, the dependence reflects an indetermination on the solutions of the system. Practically, it means that, in the process of resolution, one (or more) unknown(s) will be left undetermined. Therefore n dependent equations in n unknowns will be characterized by the fact that they determine unknowns less than n and thus act as if they were less than n. With regard to the solving of equations, dependence is therefore revealed by an accident in the solving these results in the vanishing of at least on equation and the indetermination of at least one of the unknowns. It is an accident because n equations usually determine n unknowns exactly. If the method for solving the system uses linear combinations, this accident may be connected to the fact that a linear combination of the equations is zero. If the dependence is "obvious", one may even see directly that one equation is a linear combination of the others, although this will not be the central characteristic of the dependence.

Historical background

Although it might be difficult to admit for a modern mathematician so familiar with the vocabulary and basic notions of linear algebra, the above way of considering dependence between equations may be found (with the same words) in a text by Euler dating from 1750. It still prevailed in most of the texts about linear equations up to the end of the 19th century (see Dorier 1993, 1995a, 1996b).Euler's text is the first in which the question of dependence was discussed. The general idea that n equations determine n unknowns was so strong that nobody had taken the pain to discuss the exceptional case, until Euler was confronted with Cramer's paradox and pointed out this particularity.

He starts by an example with two equations:

Let us just look at these two equations

3x-2y = 5 and 4y = 6x -10, one will see immediately that it is not possible to determine the two unknowns x and y, as if one eliminates x , then the other unknown y disappears by itself and one gets an identical equation, from which it is not possible to determine anything. The reason for this accident is quite obvious as the second equation can be changed into 6x -4y = 10, which being simply the first one doubled, is thus not different.

It is clear -especially by reading the end of this quotation- that Euler does not intend to fool his reader, even though he artificially hides the similarity of the two equations. Yet, it is also clear that it is not the fact that the two equations are similar that determines the dependence of the equations, but the fact that something unusual -an accident- happens in the final step of the solving process. This accident reveals the dependence of the equations, because, although there are two of them, these equations do not determine two unknowns. Mathematically speaking, the two statements are logically connected, a linear dependence between n equations in n unknowns is equivalent to the fact that the system will not have a unique solution; However the two properties correspond to two different conceptions of dependence. To be able to distinguish these two conceptions, I will call Euler's conception, inclusive dependence. I wish to insist on the fact that this conception is natural in the context in which Euler and all the mathematicians of his time were working, that is to say with regard to the solving of linear equations, and not the study of equations as objects on their own.

Let us now see what Euler says for three equations:

[…]The first one, being not different from the third one, does not contribute at all in the determination of the three unknowns.

But there is also the case when one of the three equations is contained in the two others. […] So when it is said that to determine three unknowns, it is sufficient to have three equations, it is necessary to add the restriction that these three equations are so different that none of them is already comprised in the others.

It is important to notice that, for three equations, Euler separates the case when two equations are equal from the case when the three equations are globally dependent. This points out the intrinsic difficulty of the concept of dependence which has to take all the equations in a whole system into account, and not only the relations in pairs. We will see that students have real difficulties with this point. On the other hand, Euler's use of terms such as comprised or contained, refers to the conception of inclusive dependence as we explained above. It does not mean that Euler was not aware of the logical equivalence with linear dependence, but, within his practice with linear equations, the conception of inclusive dependence is more consistent and efficient. Yet, there is a difficulty for further development; indeed, the conception of inclusive dependence is limited to the context of equations and cannot be applied to other objects like n-tuples for instance. Therefore inclusive dependence is context-dependent although linear dependence is a general concept that applies to any object of a linear structure. Yet, in his text Euler was able to bring out issues that can be considered in many aspects as the first consistent ideas on the concept of rank. Indeed, he discusses, although in a very intuitive and vague manner, the relation between the size of the set of solutions and the number of relations of dependence between equations. We will see now that it took more than a century for the concept of rank to come to maturity.

1750 is also the year Cramer published the treatise that introduced the use of determinants which was to dominate the study of linear equations until the first quarter of the 20th century. In this context, dependence was characterized by the vanishing of the determinant. The notion of linear dependence, now basic in modern linear algebra, did not appear in its modern form until 1875. Frobenius introduced it, pointing out the similarity with the same notion for n-tuples. He was therefore able to consider linear equations and n-tuples as identical objects with regard to linearity. This simple fact may not seem very relevant but it happened to be one of the main steps toward a complete understanding of the concept of rank. Indeed in the same text, Frobenius was able not only to define what we would call a basis of solutions but he also associated a system of equations to such a basis (each n-tuple is transformed into an equation). Then he showed that any basis of solutions of this associated system has an associated system with the same set of solutions as the initial system. This first result on duality infinite-dimensional vector spaces showed the double level of invariance connected to rank both for the system and for the set of solutions. Moreover, Frobenius' approach allowed a system to be seen as an element of a class of equivalent systems having the same set of solutions: a fundamental step toward the representation of sub-spaces by equations.

This brief summary of over a century of history shows how adopting a formal definition (here of linear dependence and independence) may be a fundamental step in the construction of a theory, and is therefore an essential intrinsic constituent of this theory.

Didactical issues

Anyone who has taught a basic course in linear algebra knows how difficult it may be for a student to understand the formal definition of linear independence, and to apply it to various contexts. Moreover, once students have proven their ability to check whether a set of n-tuples, equations, polynomials or functions are independent, they still may not be able to use the concept of linear independence in more formal contexts.

A. Robert and J. Robinet (1989) have tested beginners on the following questions:

1. Let U, V and W be three vectors in IR3. If any pair of them is non-collinear, are they independent?

2.1. Let U, V and W be three vectors in IR3, and f a linear operator in IR3. If U, V and Ware independent, are f(U), f(V) and f(W) independent ?

2.2. Let U, V and W be three vectors in IR3, and f a linear operator in IR3. If f(U), f(V)and f(W) are independent, are U, V and W independent ?

Beginners generally failed these questions. In the three cases, they used the formal definition of linear independence and tried different combinations with the hypotheses and the conclusions leading to apparently erratic proofs, that teachers usually reject without further comment.

For instance, to the first question a majority of students answered "yes" giving proofs that are examples of the difficulty they have in treating linear (in)dependence globally. Indeed, many students are inclined to treat the question of linear (in)dependence by successive steps, starting with two vectors, and then introducing the others one by one much like Euler did. We will say that they have a local approach to a global question. Indeed, in many cases, at least if it is well controlled, this approach may be correct and actually quite efficient, yet, it is a source of mistakes in several situations. The students have built themselves what G. Vergnaud (1990) calls theoremes-en-acte (i.e. rules of action or theorems that are valid in some restricted situations but create mistakes when abusively generalized to more general cases). Here is a non-exhaustive list of theoremes-en-acte connected with the local approach of linear (in)dependence, that we have noticed in students' activities:

- if U and V are independent of W, then U, V and W are globally independent

- if U1 is not a linear combination of U2,U3,…, Uk , then U1,U2,…, Uk are independent

- if U1, V1 and V2 are independent and if U2, V1 and V2 are independent, U1,U2, V1 and V2 are independent.

The historical analysis confirms the fact that there is a difficulty in treating the concept of linear (in)dependence as a global property(remember the distinction made by Euler for three equations). It follows that special care must be taken in the teaching regarding this point. For instance the exercise above can be discussed with the students. Moreover, the teacher, knowing the type of th_or_mes-en-acte, that students may have built, must help them in understanding their mistakes and thereby correct them more efficiently.

To questions 2.1 and 2.2 above, many students answered respectively "yes" and "no", despite coming close to writing the correct proof for the correct answers. Here is a reconstructed proof that reflects the difficulties of the students:

If aU + bV +gW = 0 then f(aU + bV + gW) = 0

so f being a linear operator: af(U) + bf(V) +gf(W) = 0 ,

now as U, V and W are independent, then a =b=g =0,

so f(U), f(V) and f(W) are independent.

In their initial analysis, A. Robert and J. Robinet concluded that this type of answer was revealing a bad use of mathematical implication as characterized by the confusion between hypothesis and conclusion. This is indeed a serious difficulty in the use of the formal definition of linear independence. In the following year, we tested the validity of this hypothesis with different students. Before the course, we set up a test to evaluate the students' ability in elementary logic and particularly in the use of the mathematical implication (Dorier 1990a and b), and after the course, we gave the same exercise as above to the students. The results showed that the correlation was insignificant, in some cases it was even negative. Yet, on the whole (both tests included many questions), there was quite a good correlation between the two tests. This shows that if a certain level of ability in logic is necessary to understand the formalism of the theory of vector spaces, general knowledge, rather than specific competence is needed. Furthermore, if some difficulties in linear algebra are due to formalism, they are specific to linear algebra and have to be overcome essentially in this context.

On the other hand, some teachers may argue that, in general, students have many difficulties with proof and rigor. Several experiments that we have made with students showed that if they have connected the formal concepts with more intuitive concepts, then they are in fact able to build very rigorous proofs. In the case of the preceding exercise for instance, after the test, if you ask the students to illustrate the result with an example in geometry, they usually realize very quickly that there is something wrong. It does not mean that they are able to correct their wrong statement, but they know it is not correct. Therefore one main issue in the teaching of linear algebra is to give our students better ways of connecting the formal objects of the theory with their previous conceptions, in order to have a better intuitively based learning. This implies not only giving examples but also to show how all these examples are connected and what the role of the formal concepts is with regard to the mathematical activity involved.

For instance, R. Ousman (1996) gave a test to students in their final year of the lycee (just before entering university). Through this test, he wanted to analyze the conception of students on dependence in the context of linear equations and in geometry before the teaching of the theory of vector spaces. He gave several examples of systems of linear equations and asked the students whether the equations were independent or not. Of course he noticed mistakes due to a local approach but the answers showed also that the students justify their answer through the solving of the system. Therefore they very rarely give a justification in terms of linear combinations but most of the time in terms of equations vanishing or unknowns remaining undetermined. Their concept of (in)dependence is, like Euler's, that of inclusive dependence and not linear dependence. Yet, this is not surprising, as these students, like Euler and the mathematicians of his time, are only concerned with solving a linear system, therefore inclusive dependence is more natural and more relevant for them.

However, the formal concept is the only means to comprehend all the different types of "vectors" in the same uniform manner, as subject to linear combinations. In other words, students must be aware of the unifying and generalizing nature of the formal concept. In our research, we used what we called the meta lever. Therefore we built teaching situations leading students to reflect on the nature of the concepts with explicit reference to their previous knowledge (Dorier 1991,1992, 1995b and 1997 and Dorier et al. 1994a and b). In this approach, the historical analysis is a source of inspiration as well as a means of control. Nevertheless, these activities must not only involve a lecture by the teacher, nor a reconstruction of the historical development, but take into account the specific constraints of the teaching context, to reconstruct an evolution of the concepts with consistent meaning.

For instance, with regard to linear (in)dependence, French students entering university normally have a good practice of Gaussian elimination for solving systems of linear equations. It is therefore possible to begin teaching linear algebra by making them reflect on this technique not only as a tool but also as a means to investigate the properties of the systems of linear equations. This does not conform to the historical development, as the study of linear equations was historically mostly held within the theory of determinants. Yet, Gaussian elimination is a much less technical tool and a better way for showing the connection between inclusive dependence and linear dependence as identical equations (in the case when the equations are dependent) are obtained by successive linear combinations of the initial equations. Moreover, this is a context in which such question as "what is the relation between the size of the set of solutions of a homogeneous system and the number of relations of dependence between the equations?" can be investigated with the students as a first intuitive approach for the concept of rank. M. Rogalski has experimented with teaching sequences illustrating these ideas (Rogalski 1991, Dorier et al. 1994a and b and Dorier 1992 and 1997).

Finally we give the scheme of a teaching experiment that we have set up for the final step in the teaching when introducing the formal theory after having made as many connections as possible with previous knowledge and conceptions in order to build better intuitive foundations.

After the definitions of vector space, sub-space and linear combination, the notion of generator is defined. A set of generators gathers all the information we have on the sub-space, it is therefore interesting to reduce it to the minimum. The question thus is to know when it is possible to take away one generator, with the remaining vectors still being generators for the whole sub-space. The students easily find that the necessary and sufficient condition is that the vector that can be taken away must be a linear combination of the others. This provides the definition of linear dependence: "a vector is linearly dependent on others if and only if it is a linear combination of them". This definition is very intuitive, yet it is not completely formal, and it needs to be specified for sets of one vector. Without difficulty it induces the definition of a set of independent vectors as a set in which no vector is a linear combination of the others. To feel the need for a more formal definition, one just has to reach the application of this definition. Indeed, students must answer the question: "are these vectors independent or not?". With the definition above, they need to check that each vector, one after the other, is a linear combination of the others. After a few examples, with at least three vectors, it is easy to explain to the students that it would be better to have a definition in which all the vectors play the same role. One is now ready to transform the definition of linear dependence into: "vectors are linearly independent if and only if there exists a zero linear combination of them, whose coefficients are not all zero." The definition of linear independence being the negation of this, it is therefore a pure problem of logic to reach the formal definition of linear independence. A pure problem of logic, but in a precise context, where the concepts make sense to the students from their intuitive background.

The evaluation of our research proved that students having followed an experimental teaching based on this approach are more efficient in the use of the definitions of linear dependence and independence, even in formal contexts. Their scores for exercises such as the three questions quoted above are much higher than the scores of students having followed a more classical teaching (Dorier 1997).

Moreover, it is quite a discovery for the student to realize that a formal definition may be more practical than an "intuitive" one. Indeed, most of them keep seeing the fact that a vector is a linear combination of the others as a consequence of the definition of linear dependence. Therefore they believe that this consequence is the practical way of proving that vectors are or are not independent, even if that goes contrary to their use of these definitions.

This example is relevant with regard to the question about the role of formalism in linear algebra. Formalism is what students themselves confess to fear most in the theory of vector spaces. One solution is to avoid formalism as far as possible, or at least to make it appear as a final stage gradually. Because, from our historical analysis, we have pointed out evidence that formalism is essential in this theory, we give a different answer: formalism must be put forward in relation to intuitive approaches as the means of understanding the fundamental role of unification and generalization of the theory. This has to be an explicit goal of teaching. This is not incompatible with a gradual approach toward formalism, but it induces a different way of thinking out the previous stages. Formalism is not only the final stage in a gradual process in which objects become more and more general, it must appear as the only means of comprehending different aspects within the same language. The difficulty here is to give a functional aspect to formalism while approaching it more intuitively.

Linear dependence is a formal notion that unifies different types of dependencies, which interact with various previous intuitive conceptions. It has been shown above how in the historical development of linear algebra the understanding of this fact was essential for the construction of the concept of rank and partly of duality. In teaching, this questioning has to be made explicit, if we do not want misunderstandings to persist. Therefore even at the lowest levels of the theory the question of formalism has to be raised in interaction with various contexts where the students have built previous intuitive conceptions. The construction of a formal approach right from the beginning is a necessary condition for understanding the profound nature of the theory of vector spaces. In this sense, formalism has to be introduced as the answer to a problem that students are able to understand and to make their own, in relation to their previous knowledge infields where linear algebra is relevant. These include at least geometry and linear equations but may also include polynomials or functions, although in those fields one may encounter more difficulties.

Conclusion

The theory of vector spaces is a unifying and generalizing theory, in the sense that, historically, not only did it allow solving new problems in mathematics, but it essentially unified tools, methods and results from various backgrounds in a very general approach. Thus its formalism is a constituent of its nature. Yet all the problems our students may solve with this theory could be solved with less sophisticated tools which they have already learnt (or at least are supposed to have learnt). Therefore the gains of this unification and generalization have to be understood by them, if we want them to accept this formalism and to use the theory correctly. General talk on the subject will not improve the situation, instead we have to build teaching sequences in which this idea can be understood in relation to a mathematical activity so that it becomes a personal reflection of the students. The students must be able to see the relationship between their knowledge and intuition in concrete contexts and the formal language of the theory of vector spaces. In our research work, we state that one must use this "meta-lever" to bring students to a personal understanding of the unifying and generalizing nature of the theory of vector spaces (see Dorier et al. 1995a and b; Dorier 1992, 1995b and 1997).


REFERENCES

Cramer, G. (1750): Introduction a l'analyse des courbes algebriques, Geneve: Crameret Philibert.

Dorier,J.-L. (1990a): Contribution a l'eude de l'enseignement a l'universite des premiers concepts d'algebre lineaire. approcheshistorique et didactique, These de Doctorat de l'Universite J. Fourier - Grenoble 1.

Dorier, J.-L. (1990b): Analyse dans le suivi de productions d'etudiants de DEUG A en algebrelineaire, Cahier DIDIREM no6, IREM de Paris VII.

Dorier, J.-L. (1991): Surl'enseignement des concepts elementaires d'algebre lineaire a l'universite, Recherches en Didactique des Mathematiques11(2/3), 325-364.

Dorier, J.-L. (1992): Illustrer l'Aspect Unificateur etGeneralisateur de l'Algebre Lineaire, Cahier DIDIREM no14,IREM de Paris VII.

Dorier J.-L. (1993):L'emergence du concept de rang dans l'etude des systemes d'equations lineaires, Cahiers du seminaire d'histoire desmathematiques (2e serie) 3 (1993), 159-190.

Dorier, J.-L., RobertA., Robinet J. and Rogalski M. (1994a): L'enseignement de l'algebre lineaire en DEUG premiere annee, essai d'evaluation d'une ingrnierie longue et questions, In Vingt ans de Didactique des Mathematiques en France,Artigue M. et al. (eds), Grenoble : La Pensee Sauvage,328-342.

Dorier, J.-L, Robert, A.,Robinet, J. and Rogalski, M. (1994b): The teaching of linear algebra in first year of French science university, in the Proceedings of the 18th conference of the international group for the Psychology of Mathematics Education, Lisbonne, 4 vol., 4:137- 144.

Dorier, J.-L. (1995a): Ageneral outline of the genesis of vector space theory, HistoriaMathematica 22(3), 227-261.

Dorier, J-L. (1995b): Metalevel in the teaching of unifying and generalizing concepts in mathematics,Educational Studies in Mathematics 29(2), 175-197.

Dorier J.-L. (ed.) (1997):L'enseignement de l'algebre lineaire en question, Grenoble: LaPensee Sauvage.

Euler, L. (1750): Sur une contradictionapparente dans la doctrine des lignes courbes, Memoires del'Academie des Sciences de Berlin 4, 219-223, or, Opera omnia, 3 series - 57 vols., Lausanne: Teubner - Orell Fussli - Turicini,1911-76, 26:33-45.

Frobenius, G. F. (1875): Uber das Pfaffsche Problem, Journal fur reine und angewandte Mathematik 82, 230-315.

Frobenius, G. F. (1879): Uber homogene totale Differentialgleichungen, Journal fur reine und angewandte Mathematik 86, 1-19.

Ousman, R. (1996): Contribution a l'enseignement de l'algebre lineaire en premiere annee d'universite, These de doctoratde l'universite de Rennes I.

Robert, A. and Robinet, J.(1989): Quelques resultats sur l'apprentissage de l'algebre lineaire en premiere annee de DEUG, Cahier de Didactique desMath_matiques 53, IREM de Paris VII.

Rogalski, M. (1991): Unenseignement de l'algebre lineaire en DEUG A premiere annee,Cahier de Didactique des Mathematiques 53, IREM de Paris VII.

Rogalski, M. (1994):L'enseignement de l'algebre lineaire en premiere annee de DEUGA, La Gazette des Mathematiciens 60, 39-62.

Vergnaud, G. (1990): La theorie des champs conceptuels, Recherchesen Didactiques des Mathmatiques 10(2/3), 133-170.



回《數學教育》第 六期目錄
數學教育 第六期 EduMath 6 (7/98)