Языкознание
t-
17
Л.Н. Иорданская, И.А. Мельчук
Упорядочение простых предложений в составе сложного предложения в английском языке1
В статье вводится ряд лингвистических понятий, необходимых для моделирования линейного порядка простых предложений (клауз) в составе сложного предложения английского языка: простое предложение, сложное предложение, семантические и синтаксические тема, рема и спецификатор, синтаксическая и коммуникативная автономность простых предложений. Поверхностно-синтаксическая структура (интерпретируемая в терминах зависимостей) и синтактико-коммуникативная маркировка сложного предложения служат входом для правил упорядочения простых предложений. Предлагаются и иллюстрируются правила перехода от поверхностно-синтаксической и коммуникативной структур сложного предложения к его линейной структуре.
Ключевые слова: английский синтаксис, синтаксические зависимости, коммуникативная структура и порядок слов, простое предложение, сложное предложение.
1. The problem stated
The present paper aims at describing the synthesis (= production) of complex sentences in English from the viewpoint of word order, with an eye to possible generalization to other languages. We consider the transition from
1 The first version of this paper was read and criticized by B. Bohnet, A. Burga and L. Wanner; the remarks by Burga lead to a substantive revision of the text. The next version underwent the scrutiny of M. Alonso Ramos, I. Boguslavskij, A. Burga, S. Kahane, J. Milicevic and A. Polguère; D. Beck took a serious look at the paper and eradicated several clumsy formulations and outright mistakes. Finally, objections and suggestions from J. Rooryck and A. Zimmerling allowed us to introduce additional corrections and make the paper more readable.
We are happy to express our most heartfelt gratitude to all these colleagues and friends.
the meaning of a complex sentence through its syntactic and syntactic-communicative structures to the sentence itself, concentrating on just one aspect -the linear ordering of simple clauses within a complex sentence. Our goal is to propose sentence-synthesis rules determining the order of clauses:
complex sentence ordering linearized
rules complex sentence
{syntactic & communicative structures} ^ {clausej + clause2 + ... + clausen}
As strange as this may seem, there is not much literature on clause ordering in English complex sentences, and there is, as far as we know, no work aimed at specifying clause-ordering rules.1 We are breaking virtually new ground; therefore, we need to introduce several notions and formalisms.
Since we deal with meaning-to-text type rules, this study has been carried out within the Meaning-Text framework.2 However, in order to facilitate the reading, we do our best not to be too theory-specific and we define or, at least, informally explain all the notions that we use.
Meaning-Text synthesis rules are organized into six modules:
1. Semantics: {SemR} {DSyntR}
2. Deep syntax: {DSyntR} {SSyntR}
3. Surface syntax: {SSyntR} {DMorphR}
4. Deep morphology: {DMorphR} {SMorphR}
5. Surface morphology: {SMorphR} {DPhonR}
6. Phonology: {DPhonR} {SPhonR}3
1 The literature on the question
For the queries "clause ordering in English" and "order of clauses in English," the Google search returns merely a couple dozen hits. And most of the references found (curiously, almost all of them by foreign scholars) deal with morphological techniques of marking the semantic dependency relations between clauses and describe the tendencies observed in texts. Thus, they note that subordinate clauses with such and such a conjunction or of a given length tend to precede/follow its governing clause. True, several interesting observations have been made, such as the preference for iconicity, which makes that a temporal clause describing a previous event tends to precede its governing clause talking about the following event [4]. However, this result, no matter how interesting in itself, is irrelevant for clause ordering: the semantic considerations playing out in iconicity concern the construction of the communicative structure of the sentence, not directly the linear arrangement of clauses. Lee [9] explicitly speaks about the role of communicative structure in clause ordering, but only so far as the positioning of an IF-clause is involved: it precedes in case it expresses the Theme. The only text addressing clause ordering in English is, to the best of our knowledge, the book [1, p. 1037ff]. - For Russian, we should mention the pioneering work of E. Paduceva [19].
2 The Meaning-Text approach has been presented in a number of publications, most recently -[12, p. 102ff]; for more specific details on communicative structure and syntax in this framework one can consult [10; 13; 14].
3 Here, -R stands for "representation," the curly brackets { } enclose sets, and ^ means "corresponds". The abbreviations D- and S- refer to deep and surface sublevels of representation,
The fact crucial for the understanding of the subsequent discussion is that we ! work with syntactic structures written in terms of syntactic dependencies rather than in terms of (more familiar to many) phrase-structure formalism (see, for | example, [11]). Note that throughout this paper a dependency is shown with an arrow: A-^B means 'A depends on B'.
The synthesis of a complex sentence is understood here as linear ordering of simple clauses within this sentence, plus prosodization and morphologization of the string of clauses thus obtained, with the necessary mutual adjustment of the clauses being united.1 (The notions of clause and sentence are defined in 2.1).
A simple clause is synthesized out of the lexical units that compose it; they are linked by syntactic dependencies and form a syntactic structure supplied with a communicative structure. Analogously, a complex sentence is synthesized out of the simple clauses that compose it - according to the syntactic links between them and their communicative characteristics. These are two mutually independent linguistic operations, which must be described separately. In what follows, the former - the synthesis of a simple clause - is taken
respectively; Sem-, Synt-, Morph- and Phon- stand for "semantic", "syntactic", "morphological", and "phonic". Thus, semantics, or the semantic module of a language, establishes correspondences between the set of semantic representations {SemR} of sentences and the set of deep-syntactic representations {DSyntR} of sentences, and so forth.
1 Mutual adjustment of clauses within a complex sentence
When two simple clauses synthesized in isolation are combined within a complex sentence, the following three types of modification of the clauses being combined have to be foreseen (depending, of course, on the language):
1. Lexical modifications, such as, for instance, adding the lexeme then at the beginning of the superordinate clause (= apodosis) of a conditional complex sentence that expresses a logical implication, if the subordinate clause (= protasis) precedes:
(i) The two starting clauses are (a) emigration will stop and (b) if the quality of life goes up. In case the clause (b) precedes, we can have If the quality of life goes up, then emigration will stop .
2. Morphological modifications, such as, for instance, adding a special marker to the noun modified by a relative clause:
(ii) Persian
The two starting clauses are (a) ketab in%ast lit. 'book here.is' and (b) ki anra mixanam lit. 'that itdilo I.am.reading'. The clause (b) is a relative depending on the noun ketab 'book'. When the two clauses are put together, this noun must - except for some particular cases - take the suffix -i:
Ketab + i ki anra mixanam in%ast 'The/A book that I am reading is here'.
3. Prosodic modifications, such as, for instance, changing the final prosody of a stand-alone clause to a continuation prosody in case this clause precedes another clause within the sentence:
(iii) The two starting clauses are
\ \ (a) emigration will stop and (b) if the quality of life goes up. In case the clause (a) precedes, we have
Em.grat.on w.U stOp | .f the quality of l.fe goes ¿p.
for granted; we deal only with the latter - the synthesis of a complex sentence; a synthesized simple clause is manipulated as a block.
This paper only presents some rules of Module 3 (that is, of surface syntax) of a linguistic Meaning-Text model - namely, the rules of the {SSyntR} ^ {DMorphR} transition responsible for the ordering of simple clauses within a complex sentence. To formulate these rules, we need the notions of dependent, or subordinate, clause [SubC], and governing, or superordinate, clause [SuperC].1 For simplicity's sake, we will say that a SubC syntactically depends on its SuperC, meaning that it is the SubC's syntactic head that syntactically depends on a lexeme in its SuperC. (The head of a SubC is either a subordinate conjunction, if it is present, or the finite verb, if there is no conjunction, as, for instance, in a relative clause.)
We operate from the following premise: Just as with the ordering of wordforms within a simple clause [15], the ordering of clauses within a complex sentence can be described in accordance with direct syntactic dependencies between clauses: roughly speaking, it can be reduced to the linear placement of a subordinate clause with respect to its superordinate clause.
In English, a SubC can precede or follow its SuperC or be placed inside it; a SubC cannot include its SuperC. Therefore, the following three linear arrangements of two syntactically linked clauses are possible: SubC + SuperC, SuperC + SubC, and SuperC- + SubC + -SuperC (a SubC is "inserted" into its SuperC). For instance:
(1) a. When war broke out, the stock market went up.
b. The stock market went up when war broke out.
c. The stock market, when war broke out, went up.
Considered in isolation, the complex sentence illustrated in (1) allows all three theoretically possible arrangements of its two clauses. However, within an actual text, this is not necessarily the case: some clause arrangements in a given complex sentence can be constrained by the previous context. Thus, arrangements (1b-c) are inappropriate if they follow the sentence 1941 saw the Japanese attack on Pearl Harbor (the symbol "*woX" means that the sequence X is bad because of word order):2
1 Following [1], we prefer the terms superordinate vs. subordinate (clause) to the more current terms main vs. subordinate. The term main is not relational: it characterizes the properties of a clause rather than its relationship to another clause. Thus, clause C2 being subordinate to clause C1 can itself have its own subordinate C3; however, C2 is not a main clause.
2 As for arrangement (1a), it seems to be possible in any context; for instance:
(i) 1941 saw the Japanese attack on Pearl Harbor. When war broke out, the stock market went up.
(2) b'. 1941 saw the Japanese attack on Pearl Harbor. *woThe stock market § went up when war broke out. I
c'. 1941 saw the Japanese attack on Pearl Harbor. *woThe stock market, J= when war broke out, went up.
These complications of the linear disposition of clauses are related to the communicative structure [CommS], a.k.a. Information Structure, of the sentence, so that the latter will be paid special attention. The CommS of a sentence to be synthesized is established by the Speaker - as a function of his intentions and of the preceding context - in order to make the text coherent and understandable.1 Our task in this paper is, as already said, to formulate the rules that, based on the syntactic structure and the syntactic-communicative structure of a complex sentence, determine the linear order of the clauses within it.
The paper is organized as follows: after defining some necessary notions -syntactic (2) and communicative (3) - and considering syntactic vs. thematic division of a complex sentence (4), the rules for clause ordering are presented (5), and their application is illustrated (6); finally, some conclusions are formulated (7).
Our starting data are a few hundred English sentences from the Penn Tree-Bank annotated by Alicia Burga and Leo Wanner with partial Comm-struc-tures - namely, specifying Thematic areas (Rheme ~ Theme ~ Specifiers; for the details of this annotation, see [2]). We have added to it more examples, borrowed from actual texts or supplied by ourselves in order to illustrate different possibilities.
In the examples of syntactic and communicative structures, we use shortcuts: elements immaterial for the discussion (such as some grammemes or some labels of surface-syntactic relations) are not shown, certain syntactic nodes are grouped together, internal communicative divisions of certain communicative areas are not specified where irrelevant, etc.
(ii) Share value has the tendency to strongly fluctuate. When war broke out, the stock market went up.
This is so because a temporal subordinate clause can always be interpreted as a Comm-Specifier, see below, 3.2.1.
1 Let us emphasize that computing the Comm-structure of the sentence under synthesis (by the Speaker or by a linguistic model) is a separate task, which we do not consider here. It has to be accomplished before the rules of meaning-to-text transition apply - namely, on the stage of text planning, which ensures the coherence and natural character of the text (see, for instance, [6, p. 283-286]. In the present paper, the Synt-Comm-structure of a sentence is taken for granted. -"The Speaker" (with a capital "S") stands for 'the author of this speech act' - that is, 'I'; speaker (with a lowercase "s") refers to any speaker of a language.
2. Necessary syntactic notions
2.1. Simple clause and complex sentence
Simple clause
L stands for "a given lexical unit." A simple clause is a phrase such that it satisfies simultaneously the following two conditions:
1) It is headed either by a VFIN (= a finite verb), or a CONJ(subord)-subord-conjunct^VFIN (= subordinate conjunction with its dependent VFIN) or a PREP-prepositional^VFIN (= preposition with its dependent VFIN),1 this VFIN having the subject L (VFIN-subjectival^L);
2) It contains all the direct and indirect dependents of the above configurations, except another configuration <<(CONJ'(subord)/PREP'^-) V'FIN-subjectival^-L'».
In prose, a simple clause is defined by the unique subjectival syntactic dependency; the constraint "does not contain another configuration «(CONJ(subord)/PREP^-)V'FIN-subjectival^L' ...»" ensures this uniqueness, while allowing for conjoined VFINs that share the same subject (John reads a paper and drinks coffee is a simple clause). For instance (the borders of simple clauses are shown by curly brackets): (3) a. {The stock market^subj-wentn up,}
v fin clause1
I—subord-conjunct—
{when war ^subj-broke, n out in 1941.}
<• conj j v2fin ' clause2
^-subj -1
b. {The shares {we -^subj-had 2 bought} went , up.}
<- V2fin b clause2 V1FIN clause,
c. {^e-^subj-won't , apologize -
V fin
I-prepositional
for what he^subj-said 2 } }
J J V2fin clause^ CLAUSE,
Such "incomplete" phrases as McGuire declared [that the stock market had gone up] or [What we found] proved important are treated as simple clauses; this corresponds to the traditional approach.
The proposed definition of a simple clause is valid for languages in which a clause necessarily contains a finite verb. To generalize it to all languages, the reference to VFIN must be replaced with a detailed description of possible syntactic heads that, along with a VFIN, accept a subject (predicative adjectives
1 This is a so-called headless relative clause introduced by a preposition, such as from where
he was standing or on what she said. These clauses will be discussed in connection with the rule that describes them; see 5.2.4, rule 9, p. 45.
in Japanese and Korean, predicative forms of nominal lexemes in Turkic languages, etc.).
A complex clause can be defined as a set of syntactically linked simple clauses. However, the distinction between a simple and a complex clause can be ignored in the present study, so that we will omit the adjective simple and speak simply of clauses.
Along with genuine clauses, English has what can be called quasi-clauses. A quasi-clause is one of (at least) the following regular reductions of a genuine clause:
- an absolute construction with or without a preposition, see (4);
- a participial construction with a subordinate conjunction, see (5);
- an infinitive construction with an interrogative-relative pronoun, see (6).
(4) a. {Sixty percent of the fund will be invested in stocks,}
{with the rest going into bonds.}
1 ° ° 'clause2
b. {The government abolishes its golden share in Jaguar, }CLADSEi
{the luxury auto maker being stalked by General Motors.}
1 s ° s ' clause2
(5) {When offering $1,250 to retired major leaguers,} CLADSEi
{they lost the chance of another season. }clause2
(6) {Everybody knows}clause1{whom to blame.}clause2
Since quasi-clauses behave in many respects like genuine clauses, they are considered, by analogy, to be a subset of clauses; however, they feature a number of particularities, and for the moment, our clause-ordering rules do not cover them.
Now let us switch to sentences. A sentence is an utterance that constitutes a stand-alone communicative unit; it may be or not be a clause - just as a clause may be or not be a sentence: for instance, "nominative" sentences (London. November. A rainy night. Empty streets.) and exclamatory sentences of a special type (What a horrible weather!) are not clauses; subordinate clauses are not sentences. If a sentence contains just one clause it is called a simple sentence; otherwise, it is a multiclausal sentence. Multiclausal sentences are either compound sentences, which consist of coordinated clauses (clause^-coord—► clause2), or complex sentences, which are our target here. Complex sentence
A sentence is a complex sentence if and only if it consists of at least two simple clauses one of which is syntactically subordinated to the other:
CLAUSEj-SUbord—CLAUSE2.
Here, clausEj is a superordinate clause [SuperC] for clause2, and clause2 its subordinate clause [SubC]. The top node of a SubC depends on a lexeme in its SuperC by a non-coordinate relation:
circumstantial
(7) a. {The stock market went up significantly,} {when war broke out in 1941.}
> C
' clause2
I— relative —^
b. {The shares {we had bought in 1941, went up significantly.}
r & j s j clause1
The notions of "superordinate clause" and "subordinate clause," as was stated above, are relational: "superordinate" means 'syntactic governor', and "subordinate" means 'syntactic dependent' in a given pair of syntactically linked clauses; a clausr can be superordinate with regard to clause^ being, at the same time, subordinate with regard to CLAUSEk.
A complex sentence can contain any number of syntactically linked clauses; for instance, sentence (8) contains five simple clauses, whose syntactic links are shown in Figure 1 (clauses are numbered in linear order; a clause is identified by a subscripted circled number):
(8) ç^{But McGuire said
{that, {while the S.E.C. regulates} {who files,} the law tells them}
®X ® ® ® ® ®
{when they can do so.} © ©
CLAUSE,
0
1
dir-obj
O CLAUSE2 / \ circum dir-obj / \
CLAUSE3 O O
,. I .. CLAUSE5 dir-obj
\
O
CLAUSE4
Fig. 1. A simplified syntactic structure of sentence (8) 2.2. Syntactic autonomy of subordinate clauses
The linear position of a SubC with respect to its SuperC is linked to an important property of the SubC: its syntactic autonomy/non-autonomy. These two syntactic types of subordinate clauses were introduced in [18, p. 466: neprislovnyepridatocnye (= our autonomous subordinate clauses)], whose linear position is established with respect to the whole superordinate clause, and prislovnye pridatocnye (= our non-autonomous subordinated clauses), positioned with respect to a word inside the superordinate clause.
Syntactically autonomous/non-autonomous subordinate clause A SubC is syntactically autonomous if and only if:
1) it is headed by a semantically full subordinate conjunction or its constructional equivalent;1
2) it syntactically depends on the Main Verb of its SuperC. Otherwise, a SubC is syntactically non-autonomous.
Thus, in (9a) the circumstantial SubC is syntactically autonomous, while the SubC in (9b-d) is not: in (9b), the SubC does not have a conjunction; in (9c), the conjunction is semantically empty; and in (9d), the SubC introduced by a full conjunction (SINCE) depends not on the Main Verb, but on an infinitive (DO): (9) a. When war started, John left.
b. The war that had started this year lasted several weeks.
c. He said that the war had started.
I—circum—^
d. The agreement to do this, since it seemed right to all, was reached almost instantly.
The difference between syntactically autonomous and non-autonomous SubCs is relevant for the linear placement of a SubC:
A syntactically autonomous SubC is linearly placed under the control of the syntactic-thematic structure of the complex sentence (see 3). A syntactically non-autonomous SubC is placed in accordance with the surface-syntactic relation that links it to its governor. The communicative information is much less relevant and plays mainly in colloquial language; in our rules, it is not accounted for.
1 Semantically full conjunctions
A semantically full conjunction carries meaning; it has its source in the semantic structure of the sentence. A semantically empty conjunction has no meaning and has no source in the semantic structure; it is introduced into the surface-syntactic structure of the sentence by a surface-syntactic rule (e.g., that in John knows that I am in town. or as in The market, as the press release states, is in turmoil.).
The meaning corresponding to a semantically full conjunction can be expressed by a meaningful syntactic construction - for instance, the irrealis construction seen in the sentence Had he told the truth, he would be in a better position now. Technically, such a construction is represented in the deep-syntactic structure by a fictitious lexeme - a subordinate conjunction «IFin», which gives rise, in the surface-syntactic structure, to the construction itself. In the present paper, we do not consider this type of construction.
Some semantically full subordinate conjunctions have homophones of three types that appear exclusively in syntactically non-autonomous SubCs:
- Semantically empty conjunctions, which are automatic complementizers: for instance, ifii in sentences such as John asked me if I had already met Mary.
- Semantically empty pronominal relators: for instance, whenii.1 in sentences such as The day when I arrived in Montreal was cold, but sunny.
- Semantically full relative-interrogative adverbs: for instance, whenii.2 in sentences such as Everybody knows when I arrived in Paris.
j
2.2.1. Syntactically autonomous subordinate clauses
m
g At the semantic level, the meaning of a future Synt-autonomous SubC is
I linked to the meaning of its SuperC through a semantic configuration 'o' that takes the meanings of both clauses as its Sem-actants, the meaning of the SuperC being its SemA 1: 'SubC' ^2-'o'-1^ 'SuperC'. At the syntactic level, the configuration 'o' can be expressed by a subordinate conjunction, which heads the SubC; this is the case that interests us in this paper. The conjunction in question is subordinated to the Main Verb of the SuperC by the circumstantial SSynt-relation. In the SSynt-structure, an autonomous SubC can be only circumstantial. (The more fine-grained classification of circumstantial SubCs accepted in traditional grammar - temporal, causal, conditional, concessive, etc. clauses - is taken care of by addressing, when needed under synthesis, the meaning of the subordinate conjunction.)
Sentence (10) has the semantic structure shown in Figure 2 and the surface-syntactic structure shown in Figure 3.
(10) The shares went up when the war broke out.
'war' "break out " CX—1—O <-2-
rgo up ' 'shares'
-» 0-1—>o
'after '
- O— *
1 l
o
'immediately' Fig. 2. A simplified semantic structure of sentence (10)
The semantic configurations corresponding to the SubC and the SuperC are boxed in the diagram; they are linked by the configuration of semantemes 'o' = 'immediately after', which is the source of the semantically full subordinate conjunction wheni.3 (the lexicographic number is borrowed from LDOCE Online).
rGOUP\iN WHENI.3 9 circum-
subj subord-conjunctional
O o 'BREAK OUTVi
SHAREp
subj I
o WARsg
Fig. 3. A simplified surface-syntactic structure of sentence (10)
2.2.2. Syntactically non-autonomous subordinate clauses §
q_
As indicated in the definition of syntactic autonomy, a non-autonomous | SubC either has no full subordinate conjunction or does not depend on | the Main Verb of its SuperC. Three cases must be distinguished:
- the SubC has no explicit lexical marker of subordination; it can be a conjunc-tionless completive clause (11a), a Direct Speech clause (11b), and a con-junctionless parenthetical clause (11c);
- the SubC has an explicit lexical marker of subordination - an empty complementizer that (12a), an empty parenthetical introductor as (12b), a WH-word (= interrogative-relative pronoun; e.g., which/what) as a dependent in its clause, see (12c), or a WH-word as the governor in its clause, see (12d);
- the SubC is introduced by a semantically full conjunction, but does not depend on the Main Verb of its SuperC, see (9d).
(11) The SubC includes no explicit marker of subordination
a. John said the stock market had gone up.
b. John said, "The stock market had gone up."
c. The stock market, the government believes, will go up.
(12) The SubC includes an explicit marker of subordination (semantically empty)
a. John said that the stock market had gone up.
b. As John said, the shares went up}
c. (i) The shares which John had bought a year before went up.
(ii) What John had bought a year before turned out to be profitable.
d. John left {only Mary knows why. }2
in the surface-syntactic structure of a complex sentence, a non-autonomous SubC can be of any possible syntactic type: a clause that is a SSynt-actant (the Subject, the Direct Object, etc.), a parenthetical clause, a relative clause, a circumstantial, etc. (see 5.2.4).
3. Necessary syntactic-communicative notions
3.1. Introductory remarks
Roughly speaking, a sentence has two communicative structures: one at the semantic level, a semantic communicative structure [CommS], and another
1 The subordinate conjunction AS introducing a parenthetical clause is empty in this context - it does not have a semantic source. (This AS is different from several semantically full conjunctions AS with the meanings 'while', 'when', 'because'.)
2 The last type of SubC are clauses built out of a WH-word on which depends, by the WH-rela-tive SSynt-relation, an incomplete finite clause of a particular structure: Only your mother knows ^WH-rel—who was coming or He lives you will^WH-rel— [never guess]—where.
one at the syntactic level, a syntactic CommS. The semantic CommS of a sentence is imposed on its semantic structure and characterizes the communicative organization of its meaning. The syntactic CommS of a sentence is imposed on its syntactic structure and characterizes the communicative organization of the sentence itself. The Synt-CommS is derived from the Sem-CommS.
The Semantic-Communicative Structure of a sentence was described in [10]. It is specified by the indication of Semantic-Comm-areas of a semantic structure - that is, of semantic subnetworks, which are marked by the values of eight Sem-Comm-oppositions. Namely, for each Sem-Comm-area, the corresponding value of the given Sem-Comm-opposition is indicated. However, the present paper is limited to the Syntactic-Communicative Structure of a sentence, where five Synt-Comm-oppositions are used: Synt-Thematicity, Synt-Givenness, Synt-Focalization, Synt-Perspective, and Synt-Emphasis [13, p. 8-10]. Out of these, only Synt-Thematicity will be considered in what follows.1
3.2. Syntactic-Communicative Thematicity
The Synt-Comm-opposition of Thematicity has three values: Synt-Rheme, Synt-Theme, and Synt-Specifier; each Synt-thematic area carries one of these values. The Synt-Rheme and the Synt-Theme roughly correspond to the Topic and Focus of "information structure", well known from, among others, the works of the Prague School and those of M. Halliday and K. Lambrecht. The Synt-Specifier seems to be a rather novel notion (thus, it is not mentioned, in any shape, in [3]; however, "setting" and "specification" of J. Firbas [5] clearly are its antecedents).
3.2.1. Synt-Rheme, Synt-Theme, Synt-Specifier: Definitions Syntactic Rheme
The Syntactic Rheme [RSynt] of a clause is the subtree of its syntactic structure that implements its Semantic Rheme - that is, corresponds to the chunk of the sentence's meaning that the Speaker presents as being communicated to the Addressee. Just as the RSem, the RSynt is necessary and unique in each clause; a linguistic expression that does not have an RSynt is not a clause: it is a non-clausal phrase.
1 The system of notions proposed in [10] for the description of semantic-communicative structure of sentences is crucial for the present study. Nevertheless, we are in no position to expound, explain or justify it here; it has to be accepted as a postulate.
Syntactic Theme
The Syntactic Theme [ts ] of a clause is the subtree of its syntactic structure that implements its Semantic Theme - that is, corresponds to the chunk of the sentence's meaning that the Speaker presents as something on which the Semantic Rheme bears. The TSynt is also unique in a clause, but it can be absent. In such a case, we have an all-rhematic clause (e.g., There is a polar bear over there.). Such a clause contains only an RSynt (and perhaps some Synt-Specifiers, see below), but has no TSynt. Syntactic Specifier
A Syntactic Specifier [SPSynt] of a clause is the subtree of its syntactic structure that implements one of its Sem-Specifiers - that is, corresponds to the chunk of the sentence's meaning that the Speaker decided not to include either in the Sem-Rheme or in the Sem-Theme, but to present as a characteristic of the whole situation described by the clause or of the relation between two situations described by the complex sentence. An SPSynt is not necessary and not unique in a clause: a clause can contain no
SPSynt or several SPSynts. SPSynts include:
- circumstantials of time, location, cause, goal, conditions, manner, result, source of information, etc., which characterize a situation;1
- semantically full subordinate conjunctions, such as when, because, although, SINCE, rAS if1, etc.;
- parenthetical expressions, such as unfortunately, curiously, i am afraid,
AS EVERYBODY KNOWS, etc.
All such elements do not belong to the Rheme or the Theme. To sum up: The Synt-Thematic structure of a clause has either one RSynt, or one RSynt and one TSynt; it may also include any number of SPSynts.
Let us illustrate the Synt-communicative notions just introduced with several sentences that express the same propositional meaning. A Synt-Thematic structure characterizes a sentence that is to appear in a particular context. In the examples, we represent this context using, as has been done for decades, an imaginary question Q to which the sentence under analysis can be a felicitous answer. Thus, consider the meaning in (13) and its different sentential implementations in (14) determined by different Sem-Them-structures imposed on this meaning and the different Synt-Them-structures of the corresponding sentences.2
1 Of course, not every syntactic circumstantial corresponds to a SPSynt, but only those that express a meaning not included by the Speaker in the Rheme or the Theme.
2 For simplicity's sake, we illustrate the Synt-Them-structure on the sentence itself rather than on its syntactic structure. This entails some inaccuracies, which are, however, not material to this paper.
(13) 'war' 'begin' 'whenI.3' 'increase' 'value' 'shares'
o^—1-cx-2—5-1-> o—1->o-1-> o
(14) a. Q: How did the war affect the economy?
[When war broke out A [the stock market went up.l
T Synt RSynt
b. Q: What happened to the economy?
[ When war broke out,]m [the stock market went up.l„
SPSynt RSynt
In a context with no mention of the war, the subordinate WHEN-clause in (14b) expresses an spSynt.
(14) c. Q: When did share prices go up?
[The shares went up]T [when war broke out.]„
T Synt RSynt
d. Q: What happened to share prices?
(i) [The shares]T [went up when war broke out.]„
T Synt RSynt
(ii) [The shares]Ts ^ [went up]Rs ^ [when war broke out.]SPs ^ 3.2.2. Synt-Thematic structure:properties
A Synt-Them-area can be further divided into subareas: the Synt-Them-divi-sion is - as the Sem-Them-division - recursive, so that a Synt-Them-structure forms a hierarchy. Two subareas of the same area are said to be at the same level of Synt-Them-hierarchy. Three important remarks seem appropriate at this point.
1. The Synt-Them-division applies to one of the two basic syntactic formations - a whole sentence or a clause.1
2. Any clause must undergo the Synt-Them-division (otherwise, its meaning is implemented as non-clausal phrase).2
3. A whole sentence does not form a Synt-Them-area.
1 Thematically divided non-clausal phrases
There is also an additional special case - a thematically divided non-clausal phrase. A thematically divided phrase is a verbal phrase with two or more actants or circumstantials whose linear ordering is determined communicatively rather than syntactically. For instance, the Rhemes of sentences (i) and (ii) are thematically divided phrases; the different internal thematic organization of these Rhemes determines different word order in them:
(i) [This mea.sure]T^ [[wouldgive many executives^^^
[the opportunity to report trades in their own companies' shares.]R(R^t]Rs
(ii) [This mea.sure]T^
[[wouldgive the opportunity to report trades in their own companies' share.s]TR^^
[to many executiveS.] R(R)Sjnt] RSjnt
2 A particular case of the Synt-Them-division of a sentence is a sentence that consists of a Synt-Rheme only (an all-rhematic, or thetic, sentence); schematically, {[■■■]Rs nt}, where curly brackets show the borders of the sentence and square brackets, those of the Synt-Rheme area.
The Synt-Them-division of a clause has three important properties:
- it is exhaustive: each lexical unit in the clause's SyntS belongs to one of the Synt-Them-areas;
- it is a partition: a lexical unit can belong only to one Synt-Them-area;
- it is recursive (as indicated above): generally speaking, an RSynt, a TSynt and an spSynt allow for an internal Them-division; for instance, an RSynt can be
ditfded, in its tolD, into RjyntORjyntX TSynt(RSynt) and SPSynt(RSynt), etc
In examples below, the borders of a Synt-Them-area are identified with square brackets subscripted with the Them-value of this area; thus, the notation "[ ]Tg " designates a Synt-Theme area. Communicative recursion is shown by an explicit indication of the superordinate area: the notation "[ ]T(R)s " designates the Theme subarea of a Rheme area. Recall that clauses are enclosed in numbered curly brackets. For a clause which covers precisely one Them-area (and which does not constitute a whole sentence), the area's square brackets enclose the clause's curly brackets: [{...}], as in (15c-d); for clauses that do not coincide with one Them-area, brackets appear in an inverted order, see (15b), CLAUSE1.
(15) a. [In December 1941,L„ [the shares]T [went up.]„
SPSynt 1 Synt RSynt
b. ®{[This measure]T^ ^ [[exempts many executives]TR^ ^ [from reporting such trades,]RR^ (]Rs J®
[@{[McGuire]T(SP)synt [said]R(SpD(iy],Spsynt
c. [®{[^hen]SP(T)Synt [War]T(T)Synt [br0ke 0ut,]R(T)synt}®]Tsynt [@{[the shares]T(R)synt [went up.]R(R)synt}(]Rsynt
d. [®{[The shares]T(T)synt [Went uP,]R(T)synt}®]Tsynt [@{[when]SP(R)synt [war]T(R)synt [broke 0ut ]R(R)synt}@]Rsynt
e. [®{[When]SP(SP)Synt [war]T(SP)Synt [broke out, ]R(SP)synt ®]SPsynt
({[the shares]Tsynt [went up.]Rgynt}@
f. [®{The shares
({[that = shares]T(T^ ^ [McGuire had bought]^^ (}@]Ts t
[went up.]RSynt}®
g. ®{[The shares, ]Tsynt
[©{[McGuire]T(SPysn5,nt [rePorts,]R(SP)Synt}@]SPSynt[went uP. ]RSynt}®
3.2.3. Synt-Thematic structure: representation
The Synt-ThemS is represented by means of two complementary tools. On the one hand, any Synt-Them-area is shown by boxing the corresponding syntactic subtree; a communicative box - that is, a Synt-Them-area - has a value of Thematicity associated with it.
On the other hand, the Synt-ThemS explicitly shows communicative dependency relations between Synt-Them-values. Recall that the Synt-Rheme, the Synt-Theme and the Synt-Specifier are relational notions: a Synt-Them and a Synt-Specifier are the Theme and a Specifier of a given Rheme; and a given Rheme may have its Theme and Specifiers.
Communicative dependency is represented - as any dependency - by arrows. We use two types of communicative dependencies and, accordingly, two types of arrows:
- The dominance communicative dependency holds between two units of the same level of Synt-Them-hierarchy: between a Rheme and its Theme / its Specifiers; this Comm-dependency is shown by a bold arrow: sp^-r^-t. In this way, the Synt-Them-structure shows explicitly on which Rheme such and such a Theme and such and such a Specifier depends.
- The hierarchy communicative dependency holds between two Them-values of different levels of Synt-Them-hierarchy: between a value of a Them-area of a higher level and the Rheme of a lower level (this Rheme being inside this area). In the Synt-Them structure, it is shown by a double arrow, which always points to a Rheme, since the latter is the "head" of the lower-level Comm-dependen-cy subtree. This is a convenient way of encoding the hierarchical inclusion of Comm-areas. For instance, means that this given Theme has inside of its area a Rheme of a lower level, which may have its own communicatively dominated dependents (a Theme and/or Specifiers). In example sentences, however, this hierarchical dependencyis shown by a different notation: r(t) means "this Rheme's area is included in the area of the higher Theme."
To illustrate the Synt-Them-Structure of a complex sentence, we give Figure 4: a simplified surface-syntactic representation of sentence (16); see next page.
(16) [№"W)synt [war]T(TW [broke ^'Wyn^Synt [[shares]T(R)synt [went ^WyntWt 4. Syntactic division vs. syntactic-thematic division of a complex sentence
As examples (15) show, the division of a complex sentence into simple clauses does not necessarily coincide with its division into Synt-Them areas. The mismatches between clauses and concerned Synt-Them-areas are quite common. All four set-theoretical relations are possible between a clause in a complex sentence and Synt-Them-areas:
- a clause coincides with a Synt-Them-area;
- a clause includes more than one Synt-Them-area;
- a clause is included in one Synt-Them-area;
- a clause is included in two Synt-Them-areas.
Fig. 4. A simplified surface-syntactic representation of sentence (16)
In sentence (17), clauseSj 2 3 6 implement each exactly one Synt-Them-area; clauses4 5 include each two Synt-Them-areas; clauses1 2 are together included in one Synt-Them-area, the same as clauses5 6. A clause "distributed" between two Synt-Them-areas is illustrated in (18): clause1
(17) [[^{Though twenty years have passed}Q]R(SP1)s [^Asince these stories were written, 1™,
H2J1 T(SP1)Synt SP1Synt
[@{/ think}@]Sp2Synt @{[they]xSynt [are not 0ut of date,]RSynt}©
[@{since [@{/ am told }@]Sp(Sp3)Synt [they]T(SP3)Synt [have been
required reading for the Department's personnel.]R(SP3^ t}@]SP3s t
The above mismatches seriously complicate the formulations of clause-ordering rules.
5. Clause-ordering rules
5.1. Characterization of clause-ordering rules
The proposed clause-ordering rules describe the clause ordering in declarative sentences of written English, of neutral formal style. These rules are, as we said above, part of the SSyntR ^ DMorphR module of a linguistic model. As is the case with word order rules for lexemes within clauses [15], clause-ordering rules fall into three groups:
- local clause-ordering rules cover the placement of a SubC with respect to its SuperC (5.2);
- quasi-local clause-ordering rules determine mutual arrangements of several SubCs that are co-subordinated "in parallel" to the same SuperC and positioned, in accordance with local rules, on the same side of it (5.3);
- global clause-ordering rules filter out bad arrangements based on the consideration of the whole SyntS and Synt-ThemS of the complex sentence to be synthesized (5.4).
The input for clause-ordering rules is the complete surface-syntactic representation of the complex sentence to be synthesized, with a specification of simple clauses in the SyntS and full Synt-Thematic structure. Figure 5 gives an example of clause-ordering rule input - a (simplified) SSyntR of sentence (18): (18) Q: What happened to the shares?
0{[The shares]Ts ^ [went up}^ when war broke out.}@]Rs t
rGO UP"1 FIN I r
WHEN I.3 "-►O
subord-conjunctional
I
Q ""BREAK OUT Vin subj
\
WAR s
CLAUSE2
Fig. 5. A simplified SSyntR of sentence (18)
The output of clause-ordering rules is a linearly ordered string of simple clauses supplied, when appropriate, with an approximate indication of prosody: pauses and rising/falling contours. The resulting string of clauses must undergo all the necessary adjustments - lexical, prosodic, and morphological, see Footnote 1 on p. 19; however, the rules for clause adjustment are not considered in this paper.
For the input above, the output provided by the rules given appears as clausEj + clause2; it is obtained by means of rule 4, see below, p. 42.
The information about clauses used by the clause-ordering rules is of four types:
1) the surface-syntactic relation linking the SubC to its syntactic governor. It is explicitly shown in the starting SyntS, going from a lexeme in the SuperC to the head of the SubC;
2) the position of the SubC and its SuperC in the syntactic-communicative structure with respect to the concerned Them-areas;
3) the relevant syntactic properties of some lexemes, for instance, conjunctions: some conjunctions require the anteposition of the clause they introduce (rTHE more1...), some others, the postposition (riN that1...), see below;
4) the length of the SubC in terms of the number of stressed wordforms. The order of application of the clause-ordering rules is, theoretically speaking, irrelevant: the rules are written in such a way that they can be applied in any order, provided they are applied an unlimited number of times. In practice, of course, a convenient order has to be defined; this is, however, a procedural problem, which is left out of consideration in this paper. In the illustrations, we choose the most natural order of rule application.
Notations used in clause-ordering rules
A^B element B depends on element A - semantically or syntac-
tically
SuperC^SubC the top node of the SubC syntactically depends on a lexeme in the SuperC • t/sp t/sp communicatively depends on its r
► B Thematic subarea B is included in Thematic area A
r -
A = C„
C
SuperC-/SubC--SuperC/-SubC L
L L*
X + Y X +...+ Y length (X)
A ^
I, II
clause C either implements RSynt, TSynt, SPSynt or is included in
RSynt, TSynt, SPSynt as the syntactically top clause in this area1
clause C includes RSynt or TSynt, or else is included in RSynt
(without being the top clause)
the initial part of an "interrupted" clause
the final part of an "interrupted" clause
a particular lexeme
lexeme L with its syntactic dependents
lexeme L with its syntactic dependents except the SubCs
considered in this rule
Y follows X immediately
Y follows X, not necessarily immediately
length of the phrase X in terms of number of its stressed wordforms
rising and falling intonation contours2 minor and major pauses
in the right-hand side of a rule, precedes the condition part ("if and only if")
1 As is typical for dependency structures, the top element in a subtree represents the whole subtree.
2 Prosodic indications in our rules are approximate.
In example sentences, we give only the information relevant in each particular case.
5.2. Local clause-ordering rules
5.2.1. General conditions on local clause-ordering rules
Local clause-ordering rules are subject to the following three general syntactic conditions.
Fixed-position conjunction condition
|| If a SubC 3 CONJ(ante)/CONJ(post), then it is positioned before/after its SuperC.
Certain English subordinate conjunctions allow either only for the antepo-sition of the SubC with respect to its SuperC, or for its postposition (by the way, all coordinate conjunctions require postposition). Thus, the conjunction rTHE more1 ,..[rTHE more1 ...] requires the anteposition of its SubC,1 and the conjunctions "as if1, "in that1, rso that1 require its postposition, as well as the relative pronoun which3 (as in Things improved, which surprised me). Clause-ordering rules do not apply to cases where the SubC contains such a conjunction: this SubC is linearly placed according to the corresponding feature of the conjunction.
Fixed-position conjunctions are linked with the Synt-Them-S of the sentence: under lexicalization, the selection of such a conjunction must be licensed by the thematic role of the corresponding Sem-configuration. This happens, however, in the semantic module and should not occupy us here. Clause-border marker condition
Many languages have lexical units that could be called "clause-border markers": conjunctions and relative pronouns, which necessarily occupy the clause-initial position. In English, no lexeme that depends on an element in a conjunction-introduced clause can be linearly placed before the conjunction / the relative pronoun, which is, so to speak, the absolute left border of the clause.
Let L be a lexeme L with all its syntactic dependents; ¥ is either L1 where L1 = (CONJ)/(rel) or L2 where L2 — ^ who/whose ("—means 'syntactic dependency, not necessarily direct').
|| If SuperC begins with then not SubC +.+
1 Our treatment of the conjunction 'the more1 as subordinate (even if it follows a respectable tradition: see [1, 14.13, p. 999]) needs a special justification; here it can be accepted for the sake of discussion as a hypothesis.
If a SuperC begins with ¥ - a conjunction phrase (e.g., only—be- i cause) or a phrase including a relative pronoun (e.g., who—likes... or the letter—for—whom), then its SubC that must precede it according to | the ordering rules cannot be "physically" placed before ¥. This means that a preceding SubC is inserted into its SuperC immediately after ¥. See (19), where ¥ is shown by boldface, "*wo" means 'bad because of word order' and the trespassing clause, whose linear placement violates the Clause-Border Marker Condition, is boxed:
(19) a. ({McGuire said}( ({that,({although the war had broken out,}( the shares had gone up.}( vs.
*wo({McGuire said}(
({although the war had broken out,}(
({that the shares had gone up.}(
b. ({McGuire, ({for whose father, ({as everybody knows,}(
John bought this house,}( decided to quit.}( vs.
*wo({McGuire, ({as everybody knows,}(
({for whose father John bought this house,}( decided to quit.}(
In (19a), the SubC(, which - by rule 3.1 - can precede its SuperC(, is not placed before the conjunction that, but immediately after it. (19b) shows the relevance of the condition that ¥ includes who or whose.1
Clause insertion condition || A SubC can be linearly inserted in its SuperC under specific conditions.
For instance, a SubC inserted into its SuperC, generally speaking, cannot interrupt a complete nominal word group - that is, a previously linearized saturated nominal phrase, shown below in boldface (on the word groups, see [15]); such an interruption leads to disallowed non-projectivity. (However, in some cases, such an insertion is allowed for a parenthetical clause.)
1 Clause-Border Marker Condition follows from the ban on non-projectivity (see below, 5.4, rule 20), so that formally it is superfluous. However, linguistically, it is important, since it explicitly expresses a relevant fact about English. In Latin, for instance, a focalized element of a subordinate clause could be placed before the subordinate conjunction: Hanc [= paludem] si nostri transirent, hostes expectabant lit. 'It [= a swamp] whether our.soldiers cross, the.enemies were. waiting'. = 'The enemies waited for our soldiers to cross it' [Caesar, De bello Gallico].
(20) {John's shares, when the war broke out,}^ in General Motors went up sharply.
The inverse insertion - of a SuperC into its SubC - is impossible; it is blocked by rule 20.
5.2.2. Major types of local clause-ordering rules
Local clause-ordering rules are subdivided in two groups: rules for a syntactically autonomous SubC (5.2.3) and those for a syntactically non-autonomous SubC (5.2.4).
The rules of the first group deal exclusively with a circumstantial SubC that syntactically depends on the Main Verb of its SuperC. It is linearly positioned as a function of Synt-thematic information - with respect to the whole SuperC; therefore, rules of this group have the symbol "SuperC" on their left-hand side.
The rules of the second group cover all types of SubC (including circumstantial SubCs of a special type: rules 13 and 14); the type of SSynt-dependency of the SubC is crucial here. Such a SubC is positioned with respect to its syntactic governor L - a particular lexeme within the SuperC; the left-hand side of a second-group rule necessarily contains the symbol "L".
This division of clause-ordering rules corresponds to the intuition that a SubC syntactically depends either on the "whole" SuperC (which, under a dependency approach, means the dependency on the top node of the SuperC), or on an individual lexeme within the SuperC (which can, but does not have to, be the top node of the SuperC).
5.2.3. Local clause-ordering rules for syntactically autonomous subordinate clauses
Local clause-ordering rules target a syntactically autonomous SubC (which
contains a semantically full subordinate conjunction or a meaningful syntactic construction, equivalent to such a conjunction). All syntactically autonomous SubCs are circumstantial. (But not all circumstantial SubCs are syntactically autonomous!)
The placement of a syntactically autonomous SubC with respect to its SuperC is determined by the Them-characteristics of both clauses. From this viewpoint, a SuperC-circum^ SubC pair presents five attested combinations:1
SuperCT-circum^SubCR _R
SuperCR-circum^SubCT R_T
1 Logically, more combinations are possible (e.g., the SubC can be part of the TSynt area), but we did not find the corresponding examples.
SuperCR-circum ^SubCSF I_sp
SuperC3R-circum^SubCSF 1_dP
SuperC3T-circum^SubCcR T_1
These cases are covered by four local clause-ordering rules: rules 1-4. As is to be expected, in English the Theme-clause precedes the Rheme-clause - with one exception (accounted for in rule 2.2): an indefinite Rheme precedes its Theme.
RORD 1: Placement of SubCR
RSynt
A circumstantial Sub-clause that implements an RSynt is placed after its Super-clause that implements the corresponding TSynt.
/ \ SuperCT-circum^SubCR ^ SuperC || + ...+ SubC T_R
(21) a. [({The shares went up,}(]T [({when war broke out.}(]R
b. [({McGuire bought the shares,
({which were going down,}(}(T [({when war broke ouZ.}(]r
In (21b), the circumstantial SubC(is placed after its SuperC(, which is the top clause in the TSynt area. The relative SubC( is positioned - in conformity with rule 12.1 - after the noun sharepl (not after the whole SuperC(, so that SubC( finds itself inside its SuperC(); therefore, there is no competition between SubC( (examined in this rule) and SubC( for the position after the SuperC(. (21) c. [({[The shares]T(T) [went up,]R(T) }(
[({although the opposite was expected,}(]SP(T) ]T [({when war broke out.}(]R
In (21c), the SubC under examination (= clause3) must follow its SuperC (= clausEj); but there is competition - between clause3 and clause2 (which also can follow clausEj : according to rule 3.2) - for the position immediately after the SuperC. The correct placement of clause3 is ensured by the global rule 22, which forbids the Rheme to interrupt the Theme, see (46a).
r°rd 2: Placement of SubCT
TSynt
A circumstantial SubC that expresses a TSynt can always be placed before its SuperC that expresses the corresponding RSynt, or, in a special case, after it - under the condition that this RSynt is indefinite.
/ \ SuperCR-circum^SubCT ^ 1) SubC || +...+ SuperC
R-T \ \ / \ .
^ 2) SuperC || +...+SubC | R is indefinite
An RSynt is considered indefinite if it contains such indefinite pronouns as
SOME, ONE, etc.1
(22) a. RORD 2.1
(i) [{Whet
(ii) [{But, [{when the war broke out,}]T the shares went up.}]R
(i) [{When the war broke out,}]T [{the shares went up.}]R
TSynt RSynt
Synt Synt
The correct placement of the SubC after but in (22a-ii) is ensured by the Clause-Border Marker Condition (General condition 2 above).
(22) a. (iii) [q{ When McGuire specified} ^
who can be selected,} ,^]T everybody was happy.}^]R
(iv) [^{When the war broke out
after the note had been rejected,}@]T [@{the shares went up. }@]R
In (22a-iii/iv), the correct anteposition of clause1 together with its dependent clause2 is ensured by the global rule 20, which filters out cases of forbidden non-projectivity.
(22) a. (v) [^{When the war broke out,}^]T
[g){ although the opposite was expected,}@]S [gj{the shares went up. }@]R
In (22a-v), unlike the preceding cases, clause2 syntactically depends on clause3 rather than on clause1. Therefore, we have here a co-subordination of clause1 and clause2; the mutual order of these two co-subordinated SubCs is free: both possible arrangements are correct.
(22) b. RORD 2.2
(i) Q: What is to be done to accommodate the peculiarities of crops? [Some modifications may be necessary,]R [if we want to accommodate the peculiarities of each crop.]T
JSPSynt
1 The anteposition of an indefinite Rheme also takes place at the simple clause level:
(i) The airfield was quiet. [Threeplanes]r [were sitting on the tarmac.
(ii) Q: How long ago were these stories written? [Twenty years have passed]R [since these stories were written.]T
ASynt
RORD 3: Placement of SubCSPSynt
A circumstantial SubC that implements an spSynt is placed either before, or after, or within its SuperC, which implements or includes the corresPonding RSynt-
In other words, the linear position of an SPSynt is considered to be optional. In fact, there could be additional factors at play, such as Givenness or length (= heaviness), etc., but we have not studied those.
'' " is an abbreviation for the disjunction " or ^ ".
—RSynt J RSynt 3RSynt
/ \
SuperC—R-circum^SubCSP ^ 1) SubC || +...+ SuperC
I-sp / \
^ 2) SuperC || +...+ SubC
^ 3) SuperC- | +...+ SubC | +...+ -SuperC1
(23) a. RORD 3.1
[{When the war broke out} {after the treaty was rejected,}]Sp {[the shares]T [went up.]R }
b. RORD 3.2 Synt Synt
{[The shares]T [went up,]R } [{when war broke out.}]Sp
c. RORD 3.3 Synt Synt Synt
(i) {[The shares,]T [{when war broke out,}L„ [went up.]„ }
TSynt SPSynt RSynt
(ii) *wo{[The shares, [{when war broke out,}]™
SPSynt
{which John had bought a year ago}]T [went up.]R }
Sentence (23c-ii) is incorrect because the SubC separates a relative SubC from its syntactic governor, which is forbidden by the quasi-local rule 17. Cf. the correct sentence in (23c-iii):
(23) c. (iii) {[The shares {which John had bought a year ago, }]T [{when the war broke out,}]
SPSynt
[went up immediately and reached a level never seen before.]R }
In all the sentences in (23), the SuperC does not implement a Them-area (but includes the RSynt); in (24), the SuperC is RSynt:
1 1. As indicated in Notations, "SuperC-" and "-SuperC" stand, respectively, for the initial and the final parts of the SuperC.
2. Constraining conditions are needed to determine the exact point of insertion for the SubC.
(24) a. RORD 3.1
(i) [q{ When the war broke out,}q]sp
[g,{because production doubled,,}@fT [({the stock market went up.}(]R Synt In (24a-i), the mutual disposition of the two co-subordinated SubCs, clause1 and clause2, is not constrained by quasi-local clause-ordering rules.
(24) a. (ii) [{When the war broke out} ^ {after the treaty was rejected,}]SP
[{the influenza pandemic struck.}]R
b. RORD 3.2
[{Becauseproduction doubled,}]T
TSynt
[{the stock market went up}]„ [{when war broke out. }L„
RSynt SPSynt
c. RORD 3.3
{[The stock market [{although war broke out}]SP went up.]R }
Synt Synt
RORD 4: Placement of SubCrR
CRSynt
A circumstantial SubC that is included in an RSynt Them-area is placed
after its SuperC, which includes the corresponding TSynt.
\
SuperC_,T-cireum^SubCcR ^ SuperC +...+ SubC
T_R
(25) a. ({[The shares]T [went up the next year,}q
when war broke out.}^]R
b. (^){[McGuire]T [bought theyshares
1 Synt
since he had the opportunity,}^ ({when war broke out.}@]R
5.2.4. Local clause-ordering rules for syntactically non-autonomous subordinate clauses The placement of a syntactically non-autonomous SubC is determined by the type of the surface-syntactic relation that subordinates it to a lexeme in the SuperC - not necessarily the Main Verb, as indicated above. Twelve SSynt-Rels can link the SuperC and its SubC:
- six surface-actantial SSyntRels - that is, subjectival, pseudo-subjectival1, direct-objectival, direct-speech-objectival,2 indirect-objectival and oblique-objectival SSyntRels;
1 The pseudo-subjectival relation was called "quasi-subjectival" in previous publications. The Pseudo-Subject is the element of the clause that expresses the DSynt-actant I in a clause with the dummy Subject (it in English; for instance, It is— [sfrange]-pseudo-subj^ that John should say so, where the subordinate clause that John should say so constitutes a Pseudo-Subject).
2 Direct Speech
Direct Speech [DS] and Direct Speech Introductor [DSI], or reporting clause, form two different syntactic constructions (this distinction was stated in [1, p. 1022-1023]): DS is the syntactic dependent or the syntactic governor.
- copular SSyntRel; ¡
- comparative SSyntRel;
- relative SSyntRel; |
- circumstantial SSyntRel;
- parenthetical SSyntRel;
- quasi-parenthetical SSyntRel (= subordinates a Direct-Speech Introductor). To order the corresponding clauses rules 5-15 are needed. (Rules of this
type for linearization of lexemes within English clauses can be found in [16].)
RORD 5: Placement of a subjectival SubC
A subjectival SubC is placed before its syntactic governor and all other dependents of the latter.
/ \
L-subjectival^SubC ^ SubC +...+ L*
(26) a. ({That phlogiston theory was flawed
({finally becameL clear in the 1830s.}(
The DSI is the SuperC, and the DS is its SubC.
The DS is a syntactic dependent of the Main Verb of its DSI; it is subordinated to this MV by the direct-speech-objectival surface-syntactic relation. Note that the DS cannot be considered as a genuine direct object of the reporting clause, since there is a semantic contrast of the type John whispered three words [direct object] vs. John whispered, "Three words" [direct-speech object]; for more, see [7, p. 218-220].
Communicatively, the DS is either the RSynt or a part of the RSynt of the whole sentence. The DSI is either the TSynt or consists of the TSynt and a part of the RSynt. The only linear position available for the DS is sifter the DSI:
(i) a. Q: What did McGuire say?
[McGuire said, ]t
["The biotechnology firms are setting up their own competitors."]R b. Q: What about McGuire? Synt
[McGuire]T
[took the foorlmd said, "The biotechnology firms are setting up their own competitors."]R
The DSI is the SubC, and the DS is its SuperC. Synt
The DSI is syntactically an adverbial, similar to a parenthetical, that depends on the DS by the quasi-parenthetical surface-syntactic relation: it is not a stand-alone phrase, it is omissible, carries a parenthetical prosody, and its linear position with respect to the DS is relatively free. However, it is not a genuine parenthetical: it cannot be replaced with a synonymous clause introduced by as (* "I will not go there!", as John said) and it is not repeatable with the same governor - unlike "normal" parentheticals.
Communicatively, the DSI is necessarily an SPSynt:
(ii) Q: What about the biotechnology firms?
a. ["The biotechnology firms]T [are setting up their own competitors,"]„
T Synt RSynt
[said McGuire, president of the Biotechnology Association.]SP
b. ["The biotechnology firms,]t
[said McGuire, president of thSyen tBiotechnology Association,]SP
[are setting up their own competitors."]R
(26) b. ({That John insisted}( ({he had not been there}(
({amazedL everybody.}( c. ({What John had bought lastyear}(
({later turnedLout to be valuable.}(
A SuperC can include its SubC as a necessary clausal element, like what we see in (26).
In (26b), RORD5 correctly positions clause^ the placement of clause2 is carried out by rule 7.1, while the correct mutual arrangement of clause2 and clause3 is ensured by rule 20 (non-projectivity).
RORD 6: Placement of a pseudo-subjectival SubC
I A pseudo-subjectival SubC is placed after its syntactic governor and all other dependents of the latter.
/ \
L-pseudo-subjectival^SubC ^ L* | +...+ SubC
(27) a. It amazedLus that John left early.
b. (i) ({It isLa shame}( ({that you speak Lushootseed so poorly,}(
({if we consider how much effort you've put into it.}( (ii) ({It isLa shame,}(
({if we consider how much effort you've put into it,}( ({that you speak Lushootseed so poorly.}(
c. ({ It isL strange}( ({how empty these paintings look now.}(
RORD 7: Placement of a direct-objectival SubC
|| A direct-objectival SubC is placed after its syntactic governor.
/ \
L-direct-objectival^SubC ^ L +...+ SubC1
(28) a. {McGuire saidL}
{(that) the biotechnology firms are setting up their own competitors.}
b. {McGuire knowsL exactly {where this data can be found} from his agents in Mexico.}
c. {McGuire knowsL,} {although he does not say so,} {where this data can be found.}
d. {John marriedL} {you cannot even start guessing whom.}
1 This rule does not cover such cases where a Focalized (rhematic or thematic) direct-objectival SubC precedes its syntactic governor, e.g., Where the advantage lay, he could not have said. Additional rules are needed to determine the mutual disposition of a clausal DirO and other actants of L.
RORD 8: Placement of a direct-speech SubC
A direct-speech-objectival SubC is placed after its syntactic governor and all other dependents of the latter (i.e., after the Direct-Speech Introductor).
/\ / \
L-direct-speech-objectival^-SubC ■ L* || + SubC
(29) {McGuire, the president of the Gruman Company, saidL, addressing the press:}
{"The biotechnology firms are setting up their own competitors."}
A sentence where the Direct Speech precedes the Direct-Speech Introductor, such as "The biotechnology firms are setting up their own competitors," said McGuire, manifests a different syntactic structure: see Footnote 2, p. 42-43.
RORD 9: Placement of an indirect-/oblique-objectival SubC
II An indirect-/oblique-objectival SubC is placed after its syntactic governor.
L-indirect-/oblique-objectival^SubC ■ L +...+ SubC
Indirect-/oblique-objectival SubCs are of two basic types: THAT-introduced SubCs and headless relatives.
(30) a. {We are sorryL} {(that) we have to leave early.}
b. {McGuire weighsL in} {on what is wrong with our school.} RORD 10: Placement of a copular-complement SubC II A copular-complement SubC is placed after its syntactic governor.
L-copular^SubC ■ L +...+ SubC
(31) {Kremliris hope isL,} {as the author believes,}
{that Ukraine will implode under weight of its economic problems.}
RORD 11: Placement of a comparative SubC
A comparative SubC is placed after its SuperC (which contains the lexeme L - the syntactic governor of the SubC).
/ \ L-comparative^SubC ■ SuperC(L) | +...+ SubC The right-hand side of this rule needs SuperC(L) rather than L, because of its SSyntS; otherwise, the SubC would be incorrectly placed after L.
(32) a. {The service is lessL ^ efficient now} {than it was 10years ago.} b. {The service is asL ^ efficient now} {as it was 10years ago.}
A special case of the comparative construction is the construction with so
or such:
(33) a. {I am soL ^ tired} {that my eyelids are closing on their own.}
b. {He is suchL^[an]-idiot}
{thatyou never know {what is to be expected from him}.}
RORD 12: Placement of a relative SubC
In a general case, a relative SubC is placed after its syntactic governor L and all other syntactic dependents of the latter; in a special case (according to the conditions sketched below), it is placed immediately after the whole SuperC.
Conditions C
' 1) length(SubC) > length(V*FIN); . 2) not L^subj-MV(trans)FIN-dir-obj^L'
L-relative^SubC & 1) L* +...+ SubC | not C
& 2) SuperC(L) || + SubC | C
Conditions that determine the possibility or the necessity of a "separated" relative SubC are formulated here in quite an approximate way - just to stake out the problem. Rule 12.2 describes a case of allowed non-projectivity (rule 20): the arrow of the relative dependency covers the syntactic head of the SuperC.
The prosody in a relative clause depends on what it is relative to.
(34) a. RORD 12.1
The sharesL{that John had bought a year before} went up.
b. RORD 12.2 (for so-called separated relative clauses)
(i) {A systemLemerged} {that was free of these drawbacks.}
(ii) {Several topics are discussed there}
{that reflect the support of institutional governance.}
Sentences (34b-i/ii) illustrate the relevance of the heaviness (= length) of the relative SubC.
(34) b. (iii) *wo{The systemL made serious mistakes,}
{that was supposed to be free of these drawbacks.}
Sentence (34b-iii) illustrates the impossiblity of a separated relative clause following a transitive verb with its DirO.
RORD 13: Placement of a circumstantial wH-SubC that depends on a finite verb
SubC = ¥-^wH-rel-L'(wh) (the SubC consists of a phrase ¥ depending on a wH-word)
II A circumstantial SubC of said type is placed after its syntactic governor.
L(V)FIN-circum^-SubC ■ L(V)FIN +...+ SubC
This rule deals with a particular type of SubC: a phraseologized expression consisting of a WH-word on which depends - by a special wH-relative surface-syntactic relation - a finite verb (from a limited set) with its subject (from an open-ended set) and perhaps other dependents: John lives nobody cares where; John left your mother wouldn't guess why.
(35) {John spendsL every night {God knows where} with his pals.}
RORD 14: Placement of a circumstantial SubC that does not depend on a finite verb
A circumstantial SubC that does not depend on a finite verb is placed immediately after its syntactic governor and all its other dependents. Lnon(v)FiN-circum^SubC ^ L* + SubC
(36) {John, the invincible jester, half an invalidL, {for his heart always gave him trouble,}
would be the last to agree to stop.}
RORD 15: Placement of a parenthetical/quasi-parenthetical SubC
A parenthetical/quasi-parenthetical SubC is placed before, after or inside its SuperC (which contains the lexeme L - the syntactic governor of the SubC)1.
L-parenthetical/quasi-parenthetical^SubC ^
if [MV(SubC) is a communication/opinion verb and topnode(SuperC) 4 CONJ], then topnode(SubC) = AS ^ 2) SuperC(L) +...+ SubC
^ 3) SuperC(L)- +.+ SubC +.+ -SuperC(L) | insert(parenth, clause) Rule 15.3 ensures the insertion of a parenthetical SubC into its SuperC. Since the conditions for the placement of a parenthetical inside its SuperC are rather complex, they are preferably united in a separate operator insert (pa-renth, clause); the elaboration of this operator is a challenging task, which cannot be tackled here.
(37) a. RORD 15.1
(i) {4s McGuire reports,} {the shares wentL up.}
(ii) {When, {(as) McGuire reports,} the shares wentL up,} {John sold them.}
^ 1) SubC +...+ SuperC(L)
1 Although communicatively a parenthetical or quasi-parenthetical SubC is a SPSynt, rule 15 cannot be subsumed under rule 3 (by considering the parentheticals as circumstantials), since the parentheticals feature specific properties - among other things, a parenthetical, unlike a circumstantial, is not introduced by a semantically full conjunction but requires the conditions for the introduction of the empty conjunction AS (cf. the condition in rule 15.1).
(37) a. (iii) *woMcGuire reports (Reports McGuire), the shares wentL up.
Sentence (37a-iii) is ungrammatical, since it contradicts the condition.
The sentence McGuire reports the shares went up is possible - as the realization of a syntactic structure in which the SubC is completive - that is, the direct object of the verb report with the complementizer that omitted. It is prosodically different from (37a-iii).
(37) a. (iv) {These stories are not out of date}
{since, {I am told,} they areL still widely read.}
Here the parenthetical clause I am told can be without as since its SuperC is introduced by a conjunction (= since).
(37) b. RORD 15.2
(i) {The shares wentL up,}
{as McGuire reports (as reports McGuire).}
(ii) { "I don'tL, know"} {McGuire said (said McGuire).} c. RORD 15.3
(i) {Fortunately, {McGuire reported on March 15th,}
the shares areL going up.}
(ii) {On March 15th, {(as) McGuire reports,} the shares wentL up.}
(iii) { "On March 15th, {reports McGuire,} I sawL the shares go up."}
5.3. Quasi-local clause-ordering rules
Local clause-ordering rules state correspondences between tree-like structures and their linear embodiments; more precisely, they determine the placement of a SubC with respect to its SuperC or to a lexical unit in the SuperC. In contrast, quasi-local and global clause-ordering rules are filters: they ban some ungrammatical and/or questionable arrangements.
Quasi-local clause-ordering rules take care of the mutual arrangement of two co-subordinated contiguous Cs - that is, two Co-SubCs that are to be positioned on the same side of their SuperC.1
Being a filter, a quasi-local clause-ordering rule specifies the impossible mutual disposition of two Co-SubCs. An arrangement not precluded by any quasi-local rule is considered correct (at this stage: it still can be rejected by global rules, which consider the syntactic structure of the whole sentence).
1 Two Co-SubCs can have two different governors within the same SuperC; thus, one of the Co-SubCs may depend on the MV of the SuperC, while the other depends on a different lexical element.
Two cases are distinguished: either both contiguous Co-SubCs are syntactically autonomous or at least one of them is non-autonomous.
From the viewpoint of quasi-local rules, two contiguous syntactically autonomous circumstantial Co-SubCs can appear in any mutual order.
(38) a. ({[McGuire]TSynt [bought the shares,}( ({as soon as he had the opportunity,}( ({when war broke out.}(]RSynt
b. ({[McGuire]TSynt [bought the shares,}( ({when war broke out,}( ({as soon as he had the opportunity.}(]RSynt
If at least one of two contiguous Co-SubCs is non-autonomous, there can be constraints on their linear placement as a function of their syntactic nature.
Up to this point, we found such constraints only for subjectival and relative SubCs.
RORD 16: One of the Co-SubCs is a subjectival SubC
A subjectival SubC cannot be separated from its SuperC by a Co-SubC which is not a parenthetical.
SubC'—r- SuperC-subjectival^SubC ^
^ *woSubC + SubC' + SuperC | r 4 parenthetical
- subj -1
(39) a. ({That John left early,}( ({as could be expected,}( — ({amazed
everybody.}(
b. *wo({That John left early,}(
({since we were unprepared,}( — ({amazed everybody.}(
RORD 17: One of the Co-SubCs is a relative SubC
|| A relative SubC cannot be separated from its governor L by a Co-SubC.
SubC'—r-SuperC(L)-relative^SubC ^
^ *woSuperC(L) + SubC' + SubC ^ *woSuperC-(L) + SubC' + SubC + -SuperC
(40) *wo{The shares, [{when war broke out, }SubC'
{which John had bought a year ago} went up.}
5.4. Global clause-ordering rules
Four types of English global clause-ordering rules were sketched out in [1, p. 1037-1044]:
- avoid more than double left-branching of clauses ("left branching" is linearly positioning a SubC before its SuperC);
- avoid repeated embedding of clauses;
- avoid non-projective ordering of clauses;
- avoid ordering of clauses leading to syntactic ambiguity.
We can turn these indications into global rules 18-21, presented here informally.
RORD 18: Avoid more than double left branching of clauses SubC1^SubC2^SubC3^SuperC ^ *woSubC1 + SubC2 + SubC3 + SuperC (41) a. ({When, ({if the war breaks out,}( the shares go up,}( ({we could sell them.}(
This sentence is somewhat clumsy, but correct because it manifests only double left branching. Note that, according to the Clause-Border Marker Condition (5.2.1), SubCj is considered to precede SubC2; the same holds for (41b).
(41) b. *wo({ When, ({if ({although everybody tried to prevent it,}( the war breaks out,}( the shares go up,}( ({we could sell them.}(
A triple left branching is disallowed; cf. the correct sentence (41c), with only double left branching (since SubC1 is positioned to the right of its governor, SubC2): 1
(41) c. ({When, ({if the war breaks out,}(
({although everybody tried to prevent it,}( the shares go up,}( ({we could sell them.}(
RORD 19: Avoid double embedding of clauses
SubCj ^SubC2^SuperC ^ *woSuperC- + SubC2- + SubCj + -SubC2 + -SuperC
(42) *wo{The shares, ({when the war, ({after the treaty had been rejected}(
broke out,}( went up.}
RORD 20: Avoid arrangements leading to disallowed non-projectivity
For instance, unlike sentence (43a), sentence (43b) is bad because of its non-projectivity:
(43) a. ({McGuire said}( ({that, ({while S.E.C. regulates filing,} (
the law determines the status of clients.} (
-J?
b. *wo({While S.E.C. regulatesfiling,}( ({McGuire said}( ^ ({that the law determines the status of clients.} (
Sentence (43b) can be grammatical - as the expression of a different SSynt-structure, where the wHiLE-clause depends on say rather than on determine.
An interesting case of disallowed non-projectivity is the impossibility of inserting a SuperC in its SubC:
I ^ "I I
(44) *wo({ When the war, ({the shares went up,}( broke out.}(
However, as is well known, several situations of non-projectivity are acceptable in English syntax. Here we indicate only four of them (additional constraints are not mentioned). Four allowed cases of non-projectivity
1) L(N)(Ci) + L(V, mtrans)(Cl) + C2 such thirt L(V, mtrans)-SubjeCtival^L(N)-relative^C2
I ^
Recently a book—appeared which is a dialogue between Pope Francis and
a Jewish rabbi.
2) L(compar)(C1) + L(A)(Cl) + C2 such that
L(A)-restrictive^L(compar)-comparative^ THAN
The service is lessL—efficiently conducted now than it was 10 years ago.
3) so(C1) + L(a)(C1) + C2 such that L(A)-restrictive^SO-comparative^ THAT
I ^
I am so — tired that my eyelids are closing on their own.
4) C1 + C2 + C3 such that C3-circumstantial^C1, C2-dir-objectival^C3 and MV(C2) is a mental/communication verb
Although
these books were published 20 years ago, John thinks-
that they are not dated.
RORD 21: Avoid arrangements leading to syntactic ambiguity
?woC1 + C2 + C3 such that C1^C3 and C2^C3 are potentially possible.
(45) a. ?wo({You believe}( ({that life will love you back}( ({if you love life.}(
This sentence is ambiguous in the written text (orally, prosody allows for disambiguation): C3 can depend either on C1 or C2, both interpretations making perfect sense. Therefore, the following arrangements are preferable: (45) b. ({If you love life,}(—({you believe}( ({that life will love you back.}(
c. ({You believe,}( ^ ({if you love life,}( ({that life will love you back.}(
d. ({You believe}( ({that, ({if you love life,}( ^ ({life will love you back.}(
To these four global rules, at least one other has to be added, taking into account the Synt-Them-structure.
RORD 22: Avoid arrangements in which an RSynt linearly divides a TSynt (of the same level) or vice versa *woCj + C2+ C3 such that Cj and C3 belong to TSynt/RSynt, and C2 belongs to
RSyn/TSynt.
(46) a. (i) [({[The shares]T(T)synt [went up,]R(T) t}(
[({although funds were not available,}(]SP(T)S (]Ts [({when the war broke out.}(]Rs t (ii) *wo[({[The shares]T(T)synt [went up,]R(T)synt}( [({when the war broke out,}(]Rs t [({although funds were not available.}(]SP(t)s (]Ts t
Sentence (46a-ii) is grammatical as an expression of a different Them-struc-ture, clausEj being TSynt, clause2 - RSynt, and clause3 - sPSynt.
(46) b. (i) [({That John insisted}( ({he had not been there}(]T
[({amazedL everybody.}(]Rsynt (ii) *wo[({That John insisted}( [({amazedL everybody}t ({he had not been there.}(]T^ ^
Unlike TSynt and RSynt, an spSynt can interrupt the expressions of other Them-areas:
(47) {[The service, [{although the government denies it,}]SP
SPSynt
is less efficient now]Ts ^ [{than it was 10years ago.}]Rs ^
6. Applying clause-ordering rules: an illustration
To illustrate our rules, we will examine two rather complex sentences. These illustrations present:
- the sentence S to be synthesized, with the division into clauses and Synt-Them-areas specified;
- a schema of syntactic links between clauses of S, where each clause is supplied with an explicit indication of its position with respect to Synt-Them-areas and each link, with the number of the rule applied to it;
- the SSyntR of S, with a partial Synt-ThemS (without showing the internal Them-division, where it is immaterial);
- the list of rules relevant for constructing S;
- the final string of clauses that compose S. Example 1
(48) McGuire said}(]T
[({that, [({while [the S.E.C.]T(SP(R)^ [regulates}( ({who fihs,}@]R(Sp(R))synt]Sp(R)synt [they"law]T(R)Synt [tells them}(
({when they can do s°.}©]R(R)Synt]RSynt
CLAUSE iT
0
1
dir-obj, Rule 7.1
o CLAUSE i,
' v
circum, Rule3.1 dir-obj. Rule 7.1
CLAUSE ::sp o o
u. ' CLAUSE s
dir-obj, Rule 7.1 +
O
CLAUSE 4
Fig. 6. The schema of syntactic links between clauses of sentence (48) Fig. (7) - the SSyntR of sentence (48) - is given on the next page.
The application of clause-ordering rules in sentence (48)
cLAusEj-dir-obj—clause2: RORD 7.1 gives the arrangement clausEj + clause2. cLAusE2-circum—clause3: RORD 3.1 gives the arrangement clause3 + + clause2, which leads to a conflict: should clause3 be positioned before clause^ which is followed by clause2 or between the two - that is, immediately before clause2? The answer comes from a global clause-ordering rule, RORD 20, which forbids arrangements leading to non-projectivity. In this case, the interposition of clause1 results in non-allowed non-projectivity:
*woclause3 + clause1+ clause2.
The resulting correct arrangement is clausEj + clause3 + clause^.
Rord3.2 and RORD 3.3 are also applicable, producing two more variants of clause ordering.
clause3-dir-obj —^clause4: Rord 7.1 gives clause3 + clause4; clause4 follows clause3 and precedes clause2. The incorrect arrangement *woclause3 + + cLAuSE2 + cLAuSE4 is banned by the same global rule as above: RORD
1 Taking into account the clause-Border Marker condition, cLAuSE3 is placed after the conjunction that, which is part of cLAuSE2.
Языкознание
Fig. 7. A simplified surface-syntactic representation of sentence (48)
20; the correct arrangement is clause1 + clause3 + clause4 + clause2. CLAUSE2-dir-obj^clause5: Rord 7.1 gives clause2 + clause5.
The final result is clause1 + clause3 + clause4 + clause2 + clause5. Example 2
(49) [[q{Though twenty years have passed}0]r(spi)s t
since these stories were written, }@]t(sp1)S t]sp1s t [@{I think}@]sp2synt ©{tthey]tsynt [are not out of date,]rsynt}® ^Wme [@{i am ^ }@]sp(sp3)synt [they]t(sp3)synt [have been required reading for the Department's personnel.] r(sp3)S t}@]sp3s t
CLAUSE 3 O
dir-obj, Rule 7.1
O CLAUSE4dR,=t
circum, Rule 3.1 circum, Rule 3.2
CLAUSE] SP1,
-Fl o O CLAUSE 5
R(SPl) | |
circum, Rule 2.2 parenth, Rule 15.1
I \
O O
CLAUSE 2 T(SPI) CLAUSE6 SPISP3) Fig. 8. The schema of syntactic links between clauses of sentence (49)
Fig. (9) - the SSyntR of sentence (49) - is given on the next page. The Application of Clause-Ordering Rules in Sentence (49)
CLAUSE1-rircum^clause2: RORD 2.2 gives clause1 + clause2. (RORD 2.1 is also applicable and produces another possible variant: clause2 + clause1; recall that if a Sub-clause Ca has to precede its Super-clause Cp introduced by a conjunction, Ca is linearly positioned immediately after this conjunction: 5.2.1, Clause-Border Marker Condition.)
CLAUSE4-rirCUm^clauser RORD 3.1 gives clause1 + CLAUSE4. (RORD 3.2
and RORD 3.3 are also applicable.) clause4 can be positioned either immediately after clause1 (thus preceding clause2), or after clause2. The arrangement *clause1 + clause4 + clause2 is precluded by global rule RORD 20 (forbidden non-projectivity); the remaining correct string is clause1 + clause2 + + clause4.
CLAUSE3-dir-obj^clause4: RORD 7.1 gives clause3 + clause4. As a result, we obtain three alternative strings:
1) either clause3 + clause1 + clause2 + clause4;
2) or CLAUSE1 + CLAUSE3 + CLAUSE2 + clause4;
3) or CLAUSE1 + CLAUSE2 + CLAUSE3 + clause4.
Arrangement 2 is precluded by global rule Rord 20 (non-projectivity); both arrangements 1 and 3 are possible.
cLAusE4-circum—clause5: Rord 3.2 gives clause4 + clause5. (Rord 3.1 and Rord3.3 are also applicable.) cLAusE5-parenth—clause6: Rord 15.1 gives clause6 + clause5. (rord15.2 is also applicable.)
Two final orderings are as follows (the second one coinciding with sentence (49)):
clause3 + clause1 + clause2 + clause4 + clause6 + clause5 clause1 + clause2 + clause3 + clause4 + clause6 + clause5
Fig. 9. A simplified surface-syntactic representation of sentence (49)
As one can see, the placement of circumstantial and parenthetical specifier i clauses allows for considerable
variance, some of it (more or less) free, and ma. some under the control of several additional factors. Thus, two heavy specifiers | are not welcome at the same edge of the sentence; a contextually-bound specifier is preferred at the beginning; a very light specifier is dispreferred at the end of the sentence; such phraseologized specifiers as, for instance, what is even more surprising, tend to be anteposed; etc. Further rules are required to take care of these cases.
7. Conclusions
Rounding up our exposition, we would like to attract the reader's attention to four important linguistic facts that our study has shed some light upon.
1. From the viewpoint of their linear placement, subordinate clauses are of two types: syntactically autonomous clauses, which are placed with respect to their superordinate clause as a whole; and syntactically non-autonomous clauses, placed with respect to a lexeme in their superordinate clause. A subordinate clause of the first type is a circumstantial clause, and it is introduced by a semantically full conjunction; its placement is determined by the Synt-Thematic structure of the sentence. For the placement of the second-type subordinate clauses the thematic structure of the sentence is much less relevant: syntax takes care of it.
2. As is well known, the syntactic and communicative organization of a sentence do not necessarily coincide; they need not even be isomorphic. But their divergence reaches its peak at the level of complex sentence: the borders of clauses frequently do not coincide with the borders of communicative - in our case, thematic - areas. A clause inside a complex sentence can constitute a whole thematic area, be a part of a thematic area, contain a thematic area, or be distributed between two thematic areas. This phenomenon essentially affects the ordering of simple clauses.
3. Along with the Theme and the Rheme, the Specifier proved to be a relevant Thematic notion.
4. Rules for ordering simple clauses within complex sentences are organized in three groups: local, quasi-local, and global, which corresponds to the organization of the rules for word order within simple clauses. This fact seems to indicate that the order of various components of the sentence is controlled by factors of "triple" nature, identical for wordforms, phrases and clauses.
As for the perspectives of further studies, two trends are obvious: - On the one hand, developing and sharpening the description of each particular construction, covered by our rules. The linguistic phenomena that
need to be accounted for include, in the first place, semantic considerations
(different types of Circumstantials), other communicative categories (for instance, Givenness and Focalization), and heaviness of clauses.
- On the other hand, looking for better generalizations, trying to find a common shape for rules that deal with the ordering of words and phrases within clauses and for rules that work with clauses within complex sentences.
References
1. A Comprehensive Grammar of the English Language / Quirk R. e.a. London/ NY, 1991.
2. Bohnet B., Burga A., Wanner L. Towards the annotation of Penn Tree Bank with information structure // Proceedings of the Sixth International Joint Conference on Natural Language Processing. Nagoya, 2013. Р. 1250-1256.
3. Contrasts and Positions in Information Structure / Kucerova I., Neeleman A. (eds.). Cambridge, 2012.
4. Diessel H. Iconicity of sequence: A corpus-based analysis of the positioning of temporal adverbial clauses in English // Cognitive Linguistics. 2008. № 19 (3). Р. 465-490.
5. Firbas J. Functional Sentence Perspective in Written and Spoken Communication. Cambridge, 1992.
6. Iordanskaja L. Structure communicative de l'énoncé dans la génération automatique du texte // TA-TAO: recherches de pointe et applications immédiates / Clas A., Bouillon P. (eds.). Beyrouth, 1994. Р. 275-286.
7. Iordanskaja L., Mel'cuk I. Establishing an inventory of surface-syntactic relations: valence-controlled surface-syntactic dependents of the verb in French // Dependency in Linguistic Description / Polguère A., Mel'cuk I. (eds). Amsterdam/Philadelphia, 2009. Р. 151-234.
8. Iordanskaja L., Mel'cuk I. Illocutive parenthetical verbs in Russian // Proceedings of the 5th International Conference on Meaning-Text Theory, Barcelona, September 8-9 2011. Barcelona, 2011. Р. 120-133.
9. Lee Ch.-B. The information status of English гу-clauses in natural discourse // Language Research. 2001. № 37 (3). Р. 483-505.
10. Mel'cuk I. Communicative Organization in Natural Language. The Semantic-Communicative Structure of Sentences. Amsterdam/Philadelphia, 2001.
11. Mel'cuk I. Dependency in Language // Dependency Linguistics / Gerdes K., Hajicova E., Wanner L. (eds.). Amsterdam/Philadelphia, 2014. Р. 1-32.
12. Mel'cuk I. Semantics: From Meaning to Text. Vol. 1. Amsterdam/Philadelphia, 2012.
13. Mel'cuk I. Semantics: From Meaning to Text. Vol. 2. Amsterdam/Philadelphia, 2013.
14. Mel'cuk I. Semantics: From Meaning to Text. Vol. 3. Amsterdam/Philadelphia, 2015.
15. Mel'cuk I. Word order in Russian // Слово и язык: Сб. ст. к восьмидесятилетию академика Ю.Д. Апресяна / Под ред. И. Богуславского, Л. Иомдина, Л. Крысина. M., 2011. С. 499-525.
16. Mel'cuk I., Pertsov N. Surface Syntax of English. A Formal Model within Meaning-Text Framework. Amsterdam/Philadelphia, 1987.
к.
17. Polguère A. Remarques sur les réseaux sémantiques Sens ^ Texte // Le mot, les S mots, les bons mots / Clas A. (ed.). Montréal, 1992. Р. 109-148. ^
18. Кручинина И.Н. Сложное предложение: Введение // Русская граматика / |= Под ред. Н.Ю. Шведовой. М., 1980. Т. II. С. 461-467. =
19. Падучева Е.В. O порождении сложного предложения из простых // Исследования по математической лингвистике, математической логике и информационным языкам / Под ред. Д.А. Бочвара, Ю.А. Шрейдера. M., 1972. С. 59-78.