Friday, June 20, 2025

Topics in Koa meta-analysis

Back in 2023 I realized with excitement that I was in the position to calculate some statistics on the origin of the Koa lexicon. Since then some additional forensic techniques have developed, and I can now lay out a significantly clearer and more detailed picture, including previously unexamined data around patterns and subjects of inspiration and productivity. I have to say I'm endlessly fascinated by this kind of intense meta navel gazing, and the kind of autobiographical insight it can provide (assuming I can figure out how to interpret the data).

In the domain of etymology, first of all, I now have data on all but 50 of my 1066 roots, and year-of-creation data for all but one of them. Below are the percentages of my predicate roots by source; origins with fewer than 10 tokens apiece (5% of the total) have been grouped into "Other," representing a wide range of both natural (Basque, Latin, Malay, Nahuatl, Quechua, Swahili, Turkish and many others) and artificial languages (Doraja, Esperanto, Quenya, Seadi, etc.):


Some notes on the categories...

Random words (30%) were created with the help of my vocab tracker and random-word selector, which ensures no unintended homophony among roots, and provides random suggestions for needed words weighted by the preferability of their structure. I should clarify that I never make these choices without considerable thought and review: my program makes a suggestion, but the aesthetics have to feel right before I'll accept it.


Created words (7%), on the other hand, were inspired by the Muse in some way, and created specifically for their intended use. These include mina "woman" and kane "man," lai "come back," tasi "repeat," tai "be, exist," mua "die," itu "use," kuhu "owl," etc.

Internal coinages (7%) were formed via clusters of existing vocabulary, mainly (but not exclusively) particles:

to "that [adjectival]" + a "[indefinite]" > toa "that [pronominal]"
me "with" + no "without" > meno "regardless, anyway"
ke "which" + ne "in, on, at" > kene "location"
ai "[question marker]" + saa "receive, be allowed" > aisa "pardon, excuse me, may I"

Slavic words (2%) comprise loans from Polish, Russian, Bosnian and Macedonian. There were actually enough Polish words to constitute a category in their own right, but I didn't want the remaining Slavic loans to be hidden in "Other"...

Polynesian words (11%) include Hawaiian, Samoan, Tahitian, Tongan and Maori. I think I originally expected this category to be much larger given Koa's nearly Polynesian phonology, but it's turned out that Finnish words tend to be more amenable to the constraints of Koa root structure.

Finnish loans (26%) are the largest non-random category, providing more than a quarter of Koa's core vocabulary. When looking to assign a new root, I almost always check Finnish first -- if it has a suitable word, or one I can easily modify to be suitable, it's generally my first choice. I'm positive that this is a reflection of my own aesthetic biases, but it certainly also helps that the phoneme inventory and syllabic constraints of Finnish are not much more complicated than Koa's, while providing more variety and salience than Polynesian for this purpose.

Family & Friends (1%), lastly, include coinages for Koa by other people in my life! Two former partners are represented here (anu "water" from Amelia, culi "move" and many others from Olga), friends (soto "meditate" from Jonathan), and of course my daughter (lapa "safe"). I actually really love these little lasting artifacts of my loved ones; I wish I'd sought more of these over the years! I've been considering giving other friends and family members the phonology and commissioning creations for words that have been problematic heretofore (like "rice," "springtime," "spoon" and "unenvious").

In addition to the origins of Koa words, it's also interesting to look at the timing of their creation, which has been anything but linear.


There were two significant spikes, centered on 2011 and 2023, one with a lead-in from the previous year, the other with a coda into the following. In all, these two spikes and their entourage are responsible for 876 roots, 82% of the total. It appears that word-creation is something of a separate "project" for me: to be taken on when necessary, but otherwise proceeding only at a trickle. Let's see what happens if we compare the rate of word creation with blog post creation...


Blog posts -- which tend to occur alongside significant structural or theoretical work on the language -- have spikes too, though apparently following a different drummer than word creation. For one thing, interestingly, spikes tend to skip years: unlike with words, there are no two consecutive years with blog posts being written at a higher than average rate. Also, though there may not be enough data to justify attributing causation, it would appear that structural work may drive word creation: in both cases of a significant word-creation spike, a period of structural work preceded it, sometimes for several years.

Thinking about this subjectively, I suppose it ought not to be surprising. Assigning words is probably my least favorite area of conlanging: (1) the stakes feel high, as these choices end up determining much of the aesthetic character of the language: where I make choices I dislike, these can affect my interest in the whole language; and (2) it's almost entirely a subjective/creative/artistic process, something that I can do but which is much more difficult for me to access than, say, problem-solving. (This is one reason I enjoy the company of ADHD folks, for whom eliciting that kind of creative vision tends, seemingly, to be rather effortless.)

In other words, then, what seems to happen is that my main conlanging inspiration is structural or philosophical -- phonology, morphology, syntax, pragmatics, even sociolinguistics -- and I turn to lexis basically to support my ability to continue developing those areas, or when needed in order to produce or translate a certain kind of text. Fascinating! I really had no conscious sense of this.

I did, however, have a sense that there were certain times of year when I tended to be more active in Koa development than others, and finally found a way to explore this. Again taking blog publishing rates as a convenient approximate metric on activity level, we can see that there are interesting patterns associated with months of the year:


In particular, a post is about twice as likely to get written between November and April (69%) than it is between May and October, with February by far the most productive month. On the other end of the spectrum, in almost 26 years there have only been four posts written in the month of August. My immediate assumption was that this differential was likely to be weather-related, so I did some additional research...

Using Weather Underground's history features, I put together a grid showing the average monthly temperature for each month in which posts were published, localized to wherever I happened to be at the time.


This bears out my suspicion in that again a post is about twice as likely to be published when the average temperature is below 60°F, but it's hard to get more than that from these data because the frequency of posts gets entangled with the frequency of the temperature ranges themselves. To counteract this, I redid the chart showing post frequency per temperature as a percentage of a baseline in which temperature did not affect productivity. Here the results are pretty striking:


Below a monthly average temperature of 60°F, posting activity is approximately at or significantly above baseline, reaching a magnificent high of 149% between 50°F and 54°F. On the other hand, as soon as the temperature rises to 60°F the productivity rate compared to baseline begins to fall rapidly, as low as 48% when the average temperature is at or above 70°F.

The interesting conclusion is that I clearly feel more inspired or productive when the weather is cooler: approximately the range of temperatures in which I tend to wear a hat and socks indoors, and keep a space heater running in my room. When the temperature is warmer I tend to spend more time outside doing other activities, for one thing, but I think that beyond that my mindset changes somewhat: it's not the way I spend my time, it's the way I'm inspired to spend my time.

Another thing I thought was interesting is that it seems I'm significantly more likely to feel like working on Koa when the temperature is cool but not too cold. This surprised me: I had assumed there was just a "warm weather" mode and a "cold weather" mode, but it appears that outdoor temperatures below a certain point diminish inspiration or motivation in some way.

In the end I worry whether all this may just represent cheerful but ultimately frivolous data-crunching...but given that I've given so much time, thought and effort to this project over so many years of my life, I find it rather wonderful and meaningful to get to take a look at the character of the work itself. When I feel inspired, that fire tends to seize me and I don't really notice what I'm doing; it turns out that there is some kind of internal form and science, not just to the product but to the inspiration itself. It feels kind of moving, somehow, and makes me want to take more pride, and have more faith, in my particular curious artistic process.

Thursday, June 19, 2025

Social Niceties IV: Politeness & Obscenity

At last, the fourth and final installment of this series on social-focused language in Koa: ways of indicating respect or politeness on the one hand, and obscenity or disparagement on the other. The foundation for both types of language is long-standing, and some of the constructions discussed here have existed for a long time, but in many cases there was a lot of development and extrapolation needed.

Here is a start to the subject, with the caveat that real patterns of usage will need to be developed via...well, usage. This is an area for which need has arisen rather less in existing Koa texts, in which I've been primarily (A) journaling, or (B) talking to children!

11. Respect & Politeness

11.1 The honorific -a

The most basic morpheme to add formality, politeness or deference to an utterance is the honorific particle -a (-ha after -a in the stem), which has been mentioned many times previously in this series. By default, the honorific is suffixed to the verbal unit:

kea sa se-hálu-a?
what FOC 2SG-want-HON
"what would you like?"

ai se-me-táli-a?
QU 2SG-COM-offspring-HON
"do you happen to have any children?"

kéle-a sa se-noa?
what-HON FOC 2SG-name
"what might your name be?"

The last two translations are getting a bit heavy-handed in trying to convey a kind of formality that English doesn't really express consistently. Languages with T-V-type distinctions don't quite get at it either: kélea sa senoa? could sometimes translate into Spanish as ¿cómo se llama Usted? but not always. Honorifics can be inserted to denote extra politeness even in contexts where a Spanish speaker would not use Usted except in jest: with a romantic partner, for example.

It's important to keep emphasizing that these translations are attempts to get at the intended meaning, but miss the mark somewhat. In my idiolect, "sir" is quite formal; I would always address my boss with héia "hello" or kea sa semáia "how are you?" where I would never dream of calling him "sir" in English.

It is possible to amplify the implied politeness by adding -a to more than one constituent, though in all but the most formal of situations this may feel a little over-the-top:

kéle-a sa se-nóa-ha?
what-HON FOC 2SG-name-HON
"what might your name be, sir?" "might I have the honor of your name?"

11.2 Terms of address

For polite address, the most flexible is based on the root cano "honorable, admirable":

cánoa "sir/ma'am/miss"
cano, cánoka/cánoe "gentleman/lady"

The first expression, cánoa, is ungendered: an all-purpose form of direct address for anyone being shown overt respect: approximately English "sir," "ma'am" or "miss." In 3rd-person reference the root may also appear on its own, or with the gendered suffixes -ka or -e if desired: Ai secuteapu ti cáno(ka)? "Could you assist this gentleman?" In these cases the honorific is not usual: *cánokaha.

Where "sir" or "ma'am" is intended to denote serious deference or adherence to a hierarchy -- speaking to a superior officer in the military, for example -- we have an additional option using the root ula, literally "higher in a hierarchy." Again, the form with the honorific affix is used primarily in direct address, the other forms in the 3rd person:

úlaha "sir/ma'am"
úla, úlaka, úlae "superior, superior officer (m/f)"

Note that the opposite of ula, nala "lower in a hierarchy" may be used without disparagement to refer to, for example, one's direct reports: nálaka/nálae. However, note that -- outside of the military -- in direct address this term is likely to be taken as strange or offensive: Mitace, nálaha! "With all due respect, silence, underling!"

Speaking to or of one's boss or leader specifically, etu "main, chief, lead" is available: étua for direct address, etu/étuka/étue in the 3rd person. Etu without an honorific has a jocular sense, much like "boss" may sometimes appear in English.

Lastly, if the referent is a literal member of the nobility in a given society, terms based on the root ialo would be used: iáloa "my lord, my lady, noble born"; iáloka "gentleman, nobleman"; iáloe "lady, noblewoman."

11.3 Phrases of deference

In contexts of great social distance between speaker and addressee, a number of expressions use the roots vati "require" and kase "command":

sevátia? "What can I do for you?" lit. "You require?", short for Kea sa sevatiá? "What do you require?"

kásea? or sekásea? "What would you like, sir/ma'am?" lit. "(you) command?", short for Kea sa sekásea "What do you command?"

These phrases have much wider usage than the direct English translations might suggest. (Se)kásea? might be used by a server in a nice restaurant to mean "what would you like to order?", or as a formal "pardon me?" in a situation of receiving instructions from a superior.

Also relevant to these types of contexts is heti, literally "immediate," meaning "right away, got it, yessir": a statement of understanding/acceptance of direction.

12. Disparagement & Obscenity

12.1 The pejorative
-mu

Parallel with the honorific -a is the pejorative suffix -mu, indicating dislike, irritation, disrespect or general disparagement. When its scope is a whole utterance, like -a it tends to appear on the verb:

kea sa se-éte-mu?
what FOC 2SG-do-PEJ
"What the hell did you do?"

-mu is somewhat freer than -a, however, in that its scope may also apply specifically to the constituent to which it is affixed. In the above, this gives an additional possible reading of "What did you screw up?" Other examples:

nekea sa ka túsi-mu-ni?
where FOC DEF book-PEJ-1SG
"Where's my damn book?"

ka áuto-ni i súlu-mu
DEF car-1SG MAIN.CL collapse-PEJ
"my car fucking broke down"

ka áuto-mu-ni i sulu
DEF car-PEJ-1SG MAIN.CL collapse
"my lousy car broke down"

Note that the sense of ka áutoni i súlumu, with the pejorative suffix on the verb, is general upset about the situation; in ka áutomuni i sulu, on the other hand, the suffix on "car" identifies it as the specific object of disparagement.

Question words like kea "what," keka "who" and nekea "where" may themselves also take -mu for a sense of "what/who/where the hell":

kéa-mu sa se-ma-éte?
what-PEJ FOC 2SG-IMPF-do
"What the hell are you doing?"

12.2 Obscene words

Cursing and obscene invective are, alas, not yet particularly well developed in Koa, and are still rather tame. In approximate order of intensity, the ones I've identified so far are:

havu "nasty, dishonorable, shameful" -- the opposite of cano. In my idiolect, this might correspond in strength to words like "lousy," "bastard," "asshole," etc.
ceu "despised, damned"
peke "goddamn, fucking, piece-of-shit"

The above are all adjectival in core sense, but are commonly used as interjections as well:

havu "hell, bastard, asshole"
ceu "damn"
peke "fuck, shit"

Combinations with question words are also possible:

kea/keka ceu..., ke ceu... "what/who the hell..."
nekea peke... "where the fuck..."

12.3 Phrases of disparagement

The obscenities discussed in the previous section may be directed at another party either via the imperative particle vi, or in apposition with se "you":

vihavu, havu se "you bastard/asshole"
viceu, ceu se "damn you"
vipeke, peke se "fuck you"

The imperative forms may be strengthened with a following ablative directional marker o, roughly analogous to "off" in some English expressions:

viceu o "go to hell"
vipeke o "fuck off"

Note that o is stressed in such uses, and may be so spelled: vipeke ó, etc.

Other expressions of disapprobation, all of which become stronger/more vulgar with -mu:

mitace(mu) (o) "shut (the hell) up"
(vi)lahe(mu) o "go (the fuck) away"
láhe(mu)ni o "leave me (the fuck) alone"

Of course, obscenity in Koa is still in its infancy, and a skillful speaker will no doubt find many additional creative ways of expressing these kinds of emotions!

This concludes our catalog of the language currently at Koa's disposal for social, discourse and emotive purposes...but again, this is an open-ended and constantly evolving domain in any language, and will continue to develop as opportunities for use emerge. If I can recruit even one additional speaker someday, I'm sure the richness of the language in these areas will swell very quickly.

Thursday, June 12, 2025

Polynesian in the pronouns

I just had a crazy, but maybe kind of good, idea? I was rereading this post from 2010, in which I said:

20) And speaking of pronouns, I have a provisional decision. For a little while there I was doing singular/plural for all persons (ni/nu, se/so, ta/tu), but I don't think that's the way to go. Instead, I've kept nu for "we," possibly exclusive, on the grounds that it really does mean something different than "I," along with seni/senu for dual/plural inclusive. Also possible are ponu "we, all of us (exclusive)," poseni "y'all and me" and posenu "y'all and us." This then becomes the strategy for the other persons: "y'all" is pose, and "they" is pota.

 

What's uncertain is whether I really want to maintain an inclusive/exclusive distinction (my gut says probably not in an IAL, though seni is still a useful thing to have), and also what happens when these longer forms are used with verbs: i or no i?

 

Also, if there's seni, why not tani? A matter for further thought. Maybe I'm not as sure about the above after all.


Obviously many things changed (or failed to change) in the following 15 years. I retained nu "we," so "y'all" and tu "they" in use as the plural pronouns despite what I wrote above; seni "you and I," ponu "all of us," poso "all of you" and potu "all of them" have remained in the lexicon, but to my knowledge have never actually been used.

It just occurred to me...now that my tether to IALdom has begun somewhat to fray, what would it look like if all possible combinations of pronouns could be used as long forms? This would yield amazing, Polynesian-level specificity, or beyond:

1st Person Inclusive
seni "you and I"
senu "you and we"
soni "y'all and I"
sonu "y'all and we"

1st Person Exclusive
tani "he/she and I (but not you)"
tanu "he/she and we (but not you)"
tuni "they and I (but not you)"
tunu "they and we (but not you)"

2nd Person
tase "he/she and you"
taso "he/she and y'all"
tuse "they and you"
tuso "they and y'all"

That's a lot of roots to use up for such an experimental purpose, not even getting into the fact that some of them already have meanings. But...according to the accentuation rules I've since established, clusters of particles are typically accented on the final member whereas predicates have penultimate stress: thus soní "y'all and I" could exist unambiguously alongside soni "vein."

We could even combine the extended 1st- and 2nd-person forms into some wildly specific clusters:

1st Person Universal
tasení "he/she, you and I"
tasenú "he/she, you and we"
tasoní "he/she, y'all and I"
tasonú "he/she, y'all and we"
tusení "they, you and I"
tusenú "they, you and we"
tusoní "they, y'all and I"
tusonú "they, y'all and we"

How, though, would any of these forms actually be used? How do they fit into syntax? Let's see...

1) I think it's clear that they would need to fit into the category of pronominals we recently discovered that we have: that is to say, they would not take specifiers.

2) It should be unproblematic for these to show up in topicalized, focalized, and oblique positions. In the case of topicalization, the verb would still carry a pronominal prefix if the formal role of the pronoun in question is subject. These forms cannot, however, appear as pragmatically unmarked subjects:

ta.ní, nu-hui he lúlu.pai
she.and.I 1PL-meet TEMP Thursday
"she and I -- we met on Thursday" (topicalized)

se.ní sa luta ka lina kuo
you.and.I FOC find DEF city lost
"it was us (you and I) who found the lost city" (focalized)

ni.papa vo kau tea la se.ní
my.dad PRES.CL send letter DAT you.and.I
"my dad sent us (you and I) a letter" (oblique)

*ta.ní hui he lúlu.pai
she.and.I meet TEMP Thursday
"we (she and I) met on Thursday (unmarked)

3) What about object position, though? Do we say

ta-si.nae se.ní
3SG-saw you.and.I
"he saw you-and-me"

or


ta-si.náe-nu se.ní

3SG-saw-1PL you.and.I
lit. "he saw us you-and-me"

I believe it would be the former, because Koa doesn't have obligatory pronominal clitics for definite objects in other positions (like Macedonian would do in e.g. ја купив книгата "I bought (it) the book"). I think this spread of usages would thus mirror those of the emphatic pronouns nini, sese, etc., though I haven't spelled them out before. In fact, one could potentially regard these as a more specific, more marked category of emphatic pronoun!

Let's throw the "all" forms of the plural pronouns in there as well:

ponú
 "all of us"
posó "all of you"
potú "all of them"

I don't see any particular drawbacks to letting these all exist while we explore what they might feel like in actual usage. How neat!

Saturday, June 7, 2025

Social Niceties III: Discourse Markers

This is inevitably going to be a very incomplete post. Though I'm excited finally to be collecting and documenting the words and phrases that help manage the flow and structure of discourse, this has got to be one of the most fluid and least prescribed areas of any language; I'm sure I've left out a great amount of what would be considered basic social fluency in a competent speaker of Koa. This is one of the places where Koa development is seriously impaired by its extremely small community of speakers (i.e. only me), and therefore the fact that discoveries via conversation are kind of...imaginary.

Regardless, there's a lot here, and I've done my best to sort everything out into the broad Maschler categories described by the Wikipedia article. Many of these expressions could use a whole post unto themselves to fully explore their use, so this should be regarded mainly as an inventory. Note: some expressions appear in more than one category, where senses overlap.

7. Interpersonal Discourse Markers

First, the so-called affective particles: sounds that communicate the speaker or hearer's emotion in the context of the discourse.

aa understanding (surprise)
ee uncertainty, reluctance, hedging ("ummmm, wellll...")
ei calling attention ("hey! oh! hang on!")
eu disgust ("ew, ick")
ii pain, dislike or nervousness ("ow, yikes, ugh")
oi request for repetition or confirmation ("Huh? What's that?; Isn't that right?")
oo understanding (synthesis)
ui regret, commiseration ("oy, oh man, aww")
uu excitement, pleasure ("oooh, eeeee, yay")

Requests for repetition, confirmation or understanding checks:

tasi(a)?; nóia? tási nóia? "What was that? Come again? Beg pardon?"
ai? tag question
ainá?
 tag question in positive sentences ("isn't it?" n'est-ce pas?)
aiá "Oh yes? Indeed?" Also: tag question, negative sentences ("is it?")
ai eso/tota? "is that so? really?"
ai mao? "are you sure?"
tule/tei/pea "go on!"

io is one of the most common discourse markers, and extremely difficult to translate. An extension of its translative meaning -- marking that pragmatically relevant change has occurred -- io shows a shift in topic, "now then..."; or agreement, acceptance, comprehension, or mustering of thought or courage, "m'hm, okay, aha, I see, sure, got it, so..."; or readiness or completion "there we go, that's it." It may appear in various lengths, from short, chopped off by a final glottal stop; or drawn out to ioo. Very few conversations of any length will take place without a liberal sprinkling of ios.

In this meaning, io appears in a number of frequent constructions. It conveys greater immediacy, or finality, or emphasizes the transition to this new state from whatever preceded it, compared to the forms without it:

io ika "okay, that's fine"
io koa "okay, great"
io iha "fantastic, awesome, wonderful"
io cuti "lovely"
io kica "clearly; got it"
io sao "that's right; for sure; no kidding"
io pakoma "understood"

The opposite of io, ca "still" indicates pragmatically relevant lack of change, and conveys reassurance, support, conciliation, deescalation, or bashfulness:

ca, caa "aww, mmmmmm; there there; now now; shucks"

Following another expression, io adds sharpness: tule io "come on!" kulu io "listen up!" ika io! "FINE!" By contrast, ca adds softness: tule ca "come along now," ika ca "that's just fine; it's okay, hon," tei ca "go right ahead."

A frequent way of showing engagement with the discourse uses the information status markers ku "old/expected" and ho "new/unexpected" to announce the listener's relationship to the status of the information being conveyed. They both function like English "m'hm," showing up in similar circumstances.

ku, kuu, kukuku "yeah, totally, naturally, of course"
ho, hoo, hohoho "oh my, yes?? oooh! you don't say. no!"

The evidentials and viridicals provide additional means for listeners to react to information, this time showing their relationship to the origin or reliability of the information:

pu "so they say, apparently, that's what I heard"
li "must be, stands to reason, you'd figure"
vu "I guess, suppose so, if you say so"

A large number of expressions convey more complex emotional orientation to the discourse, from positive to negative; here is a sampling of the most common:

eso "really! right! yeah! exactly!"
voho "wow!"
lele "oh my, oh my goodness"
nóia "oh my, my goodness" lit. "please"
oo válani "my God"
hoia X "what a X"
leki "not quite; as if!"
iti! "unlikely! as if!"
pono "that's right! as it should be! damn straight!"
su/lue/luvu/lahe "no way/come on now/get out" (shock or mock disbelief)
levi/kupo "uncool! not okay!"
kéamu "WTF"
alo "can't be helped, it is what it is"
memi "oh well" lit. "sigh"
ave "too bad, it's a shame"

As with many other expressions, these often appear with io or ca as described above: ave ca "aww, well, that's the way it goes"; nóia caa "well well, good heavens"; io pono "now that's what I'm talking about!"; luvu io "seriously, stop."

8. Referential Discourse Markers

These expressions connect the discourse in terms of sequence, causality and the like:

laa "so" (therefore, consequently, for that reason)
nii "so" (then, in that case, that being so)
sii "then, next" (subsequently)
loko "because..."
eko "also"
eta "and, but, meanwhile, on the other hand" (balancing, drawing comparison: Slavic a, Latin autem, Greek μεν...δε)
iati "that aside, besides"
ala "but, however"
hotai... "actually"
sili... "at least"
hio/male "on the contrary, just the opposite"
meno "anyway, regardless"
caene "what's more"
cahaa vela "yet again"
e tei motoa, etm. = etomó "and so on, et cetera"
molala "for example"

9. Structural Discourse Markers

Introducing information to the discourse, and indicating its perceived importance to the speaker:

io "so...; now:" (introducing something)
veama "first of all, to start with"
velopu "finally, lastly"
io poa "that's it, that's all"
kulu "look, see here; here, check this out" lit. "hear/listen"
huo "see here; check this out" (emphatic) lit. "notice"
soko "here, consider this, how about this" lit. "take this"

10. Cognitive Discourse Markers

Revealing the speaker's thought process:

ee uncertainty; filler ("um, uh, er")
nou "well, so, hmm..."
ilo/mue "y'know" lit. "know/remember"
moko/sema "like...I mean..."

In the fourth and final installment we'll finish up with a more dedicated discussion of respectful (and disrespectful) address, including a very first treatment of obscenity.

Monday, June 2, 2025

Social Niceties II: Permission & Apologies

This is the second in a series concerning the language of social conventions in Koa, beginning here.

4. Permission

The most basic and common word for "please" is nóia, from the root noi "ask for." In this and many other common expressions, note that no pronoun is present, though 1st singular is assumed. One could also say ninóia, but in that case the sense would be less the conventionalized "please," and more literally "I politely request that." Two other ways of asking for permission:

ai saa? "may I?" (response: saa)
ai ika? "is it okay?"

When speaking of entering another person's space, we have some specific expressions:

ai loa? "may I come in?" lit. "am I welcome?"
loa "yes, welcome, come in"
kono "I invite you, I accept"

The last, kono is rather formal, and could also appear as kónoa: something like "do please come in." Loa can also be used instead of hei as an all-purpose greeting when spoken by a host or occupier of an area to which others are arriving.

Incidentally, after receiving something or being granted permission, useful phrases are

kito(a) "thank you" = lit. "grateful" (pe X "for X")
auli(a) "you're welcome" = "willing, eager"

Moving forward, polite forms will be indicated with (a) or (ha) in parentheses as above; note that with this additional final syllable, such words will require a written accent to keep the stress in the right place: kito but kítoa.

5. Responses to Questions & Requests

Koa has an array of nuanced ways to say "yes" and "no," which are detailed at some length in this post. Here are some other ways to qualify a response beyond ia "yes" and na "no."

ia sai / na hetu "yes absolutely / not at all"
kia / kia na "definitely / definitely not"
tetai / tetai na "maybe / maybe not"
kae / kae na "probably / probably not"
(ni)nailo "I don't know"
(ni)nakoma "I don't understand"

6. Pardon and forgiveness

For situations of slight inconvenience or light contravention of social expectations, the most neutral way of saying "pardon me" is aisa (originally derived, as may be evident, from ai saa? "may I?"). The usual responses would be saa "feel free, go ahead" lit. "receive" or uha/pilo "it's nothing" (see below). Other more specific phrases for seeking pardon:

ana(ha) "pardon me" lit. "give," with the intention to take an action (or an object) intersecting with another person
kulu(a) "pardon me" lit. "listen," with the intention to speak to or interrupt someone

Koa also has some common affective interjections for these situations. Ie "just," usually lengthened to iee or even ieeee, can be used as a less formal alternative to aisa or any of the other lexical options to excuse oneself. Ui "oops" and ii "yikes" are sometimes appropriate, or can be combined with ie for added shadings: uiee "oh no, so sorry (with regret)"; iiee "oh dear, I'm sorry (embarrassment, nervousness)."

When the offense is more serious, the expression is tua(ha) "(I) apologize." Some possible responses are:

(vi)uha "it's okay," lit. "be free of responsibility"
(vi)pilo "no worries, it's nothing" = lit. "hold it as unimportant"
naviholi "don't worry"
cati "I forgive you"

Cati is considerably more formal or serious than the other expressions. Note that the optional vi- in some of these is an imperative marker.

On the other hand, if the desire is to communicate sympathy or commiseration rather than personal contrition, the expression for "I'm sorry" is paho or nipaho, literally "(I) regret that."

(Important note: One should carefully contrast the previous with the imperative (vi)paho "I told you so" lit. "regret!", also used to express Schadenfreude in, for example, Poker: "read 'em and weep." Tone of voice and intonation would clearly be rather different between the sympathetic and shaming/bragging readings of paho!)

It's looking like there will be two more parts to this series, and I'll hopefully be back tomorrow with what to me is the most exciting: words/phrases/noises that indicate engagement in and attitude toward a discourse, and help a conversation to flow.

Social Niceties I: Greetings

Happy spring! A couple days ago I started doing a little work definining words and phrases around apology, and somehow it mushroomed into a much larger-scale project concerning the language and formulas of social interaction in general. There was a whole lot that already existed but had never really been documented, another whole lot for which the structures or vocabulary were ripe for the picking but had not yet been established, and just a bit of creation of long-needed predicates.

Rather than laying this out as one enormous and ungainly post, let's do this as a series, kicking off with greetings. Note: Throughout these articles, translation into corresponding English equivalents is bound to be subjective and imprecise. I'll do my best, keeping in mind that my dialect is middle-class white Pacific Northwest American, spoken by an Xennial.

1. Greetings

The simplest way of greeting upon meeting someone is with forms of the predicate hei, literally "greet," listed in order from most to least formal:

héia "hello sir/ma'am"
hei
 "hello"
hei hei "hi"
héipa "hi there"

Héia adds the politeness particle -a (-ha after a final -a), which can be postposed to many of the expressions in this series; the effect is to add intentional, conscious formality or politeness to the utterance. English doesn't really grammaticalize this concept: héia still means "hello" or "hi," but in the context of talking to someone to whom one wants (or is expected) to show respect -- one's boss, for instance.

There are also expressions based on time of day. These use either koa "good" or tuni "peaceful":

amu koa, amu tuni "good morning"
pai koa, pai tuni "good day"
nahe koa, nahe tuni "good afternoon"
lila koa, lila tuni "good evening"

As usual, these could also include the polite -a if desired: ámua koa "good morning, respected person."

I feel like usage here is going to be so biased by my core languages that I don't really trust my own judgment. Based probably on English and Polish, I would instinctively suggest the expressions with amu up until noon, then nahe until 5pm or 6pm, then lila until bedtime...or pai while it's light, lila while it's dark.

Another possibility that breaks out of European conventions would be to designate the koa forms for greeting and the tuni forms for leavetaking, and just use the word for the literal time of day:

pai koa "good day (6am - 6pm, or whenever it's light)
ivo koa "night greetings (6pm - 6am, or whenever it's dark)

...or...

amu koa "good morning (6am - noon)
nahe koa "good afternoon (noon - 6pm)
lila koa "good evening (6pm - midnight)
vave koa "good wee hours" (midnight - 6am)

I kind of like that, though I've very much gotten used to saying ivo koa to my daughter as a leavetaking expression before bed. But why shouldn't there be a way to greet someone when it is, objectively, night? My intuitive cultural logic -- that "night" in this context refers only to time actually sleeping, in which case 3am still counts as "evening" -- is bellowing at me right now, so let's let this sit. I do really like the idea that e.g. amu koa and amu tuni could both correspond to English "good morning," but with a distinction between coming and going that most European languages don't care to make.

Not exactly a greeting, but also worthy of mention here is the affective ei! "hey!", used to attract attention.

2. Phrases of Meeting

Just a few useful social formulae:

kea sa se? "Who are you?"
kele sa senoa? "What's your name?" (Polite: kea sa se nóaha)
X sa ninoa "my name is X"

via/neli/cuti (poli) ko húise "(most) pleased/delighted/lovely to meet you"

kea se semai? "How are you?" = lit. "What is your subjective experience like?"
nimai X "I feel X"
eta se? or eta sese? "and you?"

ke tula sa? "What time is it?" = lit. "What hour is it?"
keka sa ka tula? "What is the time?"
(tula) lima (sa) "(it's) five (o'clock)

3. Farewells

As before, the simplest way to take leave is with the predicate moi "say goodbye," like Spanish despedirse or Polish żegnać:

móia "goodbye, sir/ma'am"
moi
"goodbye"
moi moi "bye"
móipa "bye now"

Some other ways of taking leave:

mai koa "be well, take care"
mene tuni "go in peace, peace be with you"
ata ko hinae "see you later, until next time" = lit. "until seeing each other"

As before, with polite -a: máia koa, ménea tuni.

I might have included ivo koa "goodnight" in this section based on past usage, but I'll let the above discussion stand for now. More expressions specific to sleep:

nuku koa "sleep well"
moe meli "sweet dreams" = lit. "dream sweetly"

All for tonight! I'll be back soon with permission and forgiveness.

Friday, March 28, 2025

A bit of science

This is a catch-all post to document properly the STEM-specific vocabulary that has been developing in Koa over the past several years. This hasn't been particularly intentional: like most of the language, the terms in each of these sets just sprang into being as I felt inspired about them for reasons best known to the Muse. (Who is the Muse of conlanging, anyway?

Here note the predicate tae meaning "science, study, domain of knowledge," very productively used to form the names of branches of science and thought:

címitae "language science" = linguistics
nóatae "name science" = onomastics
élatae "life science" = biology
nácatae "arranging science" = "taxonomy"
síhitae "plant science" = botany
kávotae "animal science" = zoology
pílitae "reptile science" = herpetology
lútetae "bone science" = osteology
métitae "thought science" = philosophy (or should we calque this as kávotae "wise science"?)
válatae "god science" = theology
lúkutae "number science" = mathematics
néatae "computer science"

International System of Units

We have some basics here, following the approximate phonetics of standard international usage. In some cases -- consonant clusters in particular -- they've had to be simplified.

For the base units, so far we have meta "meter" for length, and kamu "gram" for mass (I wish there were a clear word for byte!). The decimal multiples of these are formed via prefixes as usual, thus:

1012  tela- "tera-"
109  kika- "giga-"
106  meka- "mega-"
10 kilo- "kilo-"
10 heto- "hecto-"
101  teka- "deca-"

10-1  tesi- "deci-"
10-2  seni- "centi-"
10-3  mili- "milli-"
10-6  miko- "micro-"
10-9  nano- "nano-"
10-12  piko- "pico-"

The prefixes are stressed, yielding e.g. kílokamu "kilogram," sénimeta "centimeter," and so on.

Biology

Taxonomic levels:

élaine "life domain" = kingdom
osa "branch" = phylum
luo "class"
himo "tribe" = order
pele "family"
suku "kindred, stock" = genus
lei "type" = species
nálalei "subtype" = subspecies

I'm pretty happy with these except for luo which is a calque that I'm not sure is motivated by anything in particular. But on the other hand taxonomy itself is arbitrary almost by definition, so I think I feel okay with just letting that one go and turning my attention back to spending another decade wringing my hands over how to mark indefinite NPs.

At this moment I would really like to offer a devastatingly witty little mnemonic to remember these in order, like the English one about King Phillip...but alas, no such wit has yet been forthcoming. I wish E O L H P S L had an "i" in it somewhere so I could form a proper clause! How about: Énapi One Lomo Ha PaSiko Limu "if one sneaky turtle were wrapped in seaweed"..........?

Linguistics

méama
 "thing-er" = nominal
nóama "namer" = proper noun
étema "doer" = verbal
ílama "be-like-er" = adjectival

címihale "language structure" = grammar
vike "clause"
lelo "sentence"
siki "particle"
mohi "predicate"
mícoma "prefix"
hópama "suffix"

étema sia "active verb"
étema aivu "passive verb"
vike keha "conditional clause"
vike kevo "presentative clause"
vike keu "relative/adjectival clause"

méama tocu "simple nominal"
méama aivu "passive nominal"
méama oisi "abstract nominal"
méama lala "deverbal instantiated nominal"

méama laca "indefinite noun"
méama litu "definite noun"

Clearly lots more to come!

Saturday, March 22, 2025

The conditional clause

Koa has had a word meaning "if," ha -- borrowed from Hungarian, incidentally -- since late 2010. Until recently, though, there hadn't been so much as an attempt to feel out syntax with this particle, or the syntax or semantics of conditional clauses in general: I'm not sure how this is possible, but I may have managed never even to have tried to speak or write a conditional clause in this language. With the recent formalization of clause type marking, we can finally answer some extremely long-standing questions! This post attempts to bring conditional structures up to date with the rest of Koa's development.

Before diving into whole clauses, I should mention that ha can also have the scope of a single predicate, in which case it has the sense of "would-be," "theoretical," "supposed," or rather perfectly, Polish niby. This type of usage was in fact the only example I gave in ha's initial unveiling (respelled with modern conventions):

ka ha-lóha-ni
DEF COND-love-1SG
"my would-be lover," "mój niby kochanek"

There's also a ke-compound, keha, meaning "conditional" or "hypothetical": thus vike keha "conditional clause."

Conditional structures in Koa have a protasis set off by ha "if," and an apodosis optionally introduced by translative io (roughly "already"), the presentative vo, or the heavier-handed laa "consequently," any of these the Koa equivalent of "then." The particular sequence of TAM markers in each half of the construction yields some lovely complexity, which has had the additional benefit of helping to heal the wounds of Ancient Greek studies in college...I had never expected to make friends with the words "future more vivid" again!

There are two broad categories of conditional clauses, each with a set of subtypes. The first category describes situations/conditions that are really and demonstrably true, and is characterized by a protasis and apodosis both in the realis mode. The second category describes situations/conditions that are hypothetical, contrafactual, idealized, or otherwise imaginary; these are characterized by the irrealis mode in the apodosis, and usually the protasis as well.


Table 1: Real Situations

Type Example Protasis Apodosis
General nonpast If X is true, then Y is true simple simple; imperative
Habitual nonpast Whenever X happens, Y happens simple (habitual) simple habitual
Deductive nonpast If X is true now, that means Y was true then                         simple anterior (presumptive) 
General perfect If X has happened, then Y happens anterior simple
General past If X was true, then Y was true anterior anterior
Habitual past Whenever X happened, Y happened anterior (habitual) anterior habitual


Table 2: Imaginary Situations

Type Example Protasis Apodosis
Ideal nonpast If X happens, then Y will happen; If X were to happen, then Y would happen simple irrealis; imperative
Ideal perfect If X were to have happened, then Y would happen now anterior irrealis
Ideal past If X were to have happened, then Y would have happened anterior irrealis anterior
Unreal nonpast If X were true now, then Y would be true now irrealis irrealis
Unreal perfect If X had happened, then Y would happen now irrealis anterior irrealis
Unreal past If X had happened, then Y would have happened irrealis anterior irrealis anterior

Notes:
  1. In Habitual clauses, habitual marking in the protasis is optional.
  2. Other TAM information may be added to the protasis and/or apodosis (ma [imperfect], for instance) without affecting the conditional semantics.
  3. The distinction between Ideal and Unreal is somewhat idealized (no pun intended). Irrealis marking is the default for both protasis and apodosis in imaginary situations, especially in speech (other than the "vivid future," see below).
  4. The Ideal Nonpast corresponds both to what Ancient Greek would consider a more vivid future ("If the babysitter is free, then we will go see the concert") and a less vivid future ("If the baby sitter were to be free, we would go to the concert"). The Koa sense of things is that all futurity is hypothetical and thus does not make this distinction; but if the apodosis describes intention or will, the volitive lu rather than irrealis cu can be employed to approach some of that sense ("If the babysitter is free, we mean to go to the concert").

Now for the fun part: examples. There are a lot of possible uses for each of these types, so this will be just an overview to give a general sense.

Examples - Real Situations

hu anu ha ne ka ipu [simple], ka ipu vo na-moha [simple]
EXIST water COND.CL LOC DEF cup DEF cup PRES.CL NEG-empty
"if there is water in the cup, then the cup is not empty"
(General Nonpast)

ha ni-ca-nuku he-tula iva [simple], vénea-ni [imperative]
COND.CL 1SG-cont-sleep TEMP-hour nine wake-HON-1SG
"if I'm still asleep at 9 o'clock, please wake me up"
(General Nonpast)

ha me-vua [simple], ka toto i io-va-mi-hulu [habitual]
COND COM-rain DEF child VB.CL TRANS-HAB-INCH-crazy
"if/whenever it rains, the kids go crazy"
(Habitual Nonpast)

ka ovi ha hemo [simple], laa se-li-si-tule la.koto me ka áva.le [anterior presumptive]
DEF door COND.CL unlocked therefore 2SG-PRSM-ANT-come home COM DEF key
"if the door is unlocked, then you must have brought the keys home"
(Deductive Nonpast)

le_Kéoni ha si-náe-nu mo-like [anterior], vo nu-me-háka.te [simple]
John COND.CL ANT-see-1PL SIM-together PRES.CL 1PL-COM-trouble
"if John saw/has seen us together, we're in trouble"
(General Perfect)

ha ni-si-kánu-se [anterior]ve na-ilo sa ni-si-ete [anterior]
COND.CL 1SG-ANT-harm-2SG ADV.CL NEG-know FOC 1SG-ANT-do
"if I hurt you, I did so unwittingly"
(General Past)

ha si-va-me-sua ne-lani kica [anterior habitual], ni-simo i si-va-mi-tumu pe-pa.mana [anterior habitual]
COND ANT-HAB-COM-sun LOC-sky clear 1SG-heart VB.CL ANT-HAB-INCH-full BEN-intention
"if/whenever the sun was shining in a clear sky, my heart would fill up with plans"
(Habitual Past)

Examples - Imaginary Situations

le_Lóliki ha kii po kunu [simple], ta-io-cu-opi aha pe-imi [irrealis]
Olga COND.CL get UNIV dog 3SG-TRANS-IRR-learn something BEN-self
"If Olga got a dog (and she might), she would learn some things about herself"
(Ideal Nonpast - her idea!)

ha tu-si-mene i tesu [anterior], vo tu-cu-sano po pa.opo mu.kiki he.lila.tana [irrealis]
COND.CL 3PL.ANT.GO VB.CL travel PRES.CL 3PL-IRR-say UNIV story funny tonight
"If they were to have gone on a trip (and they might have), they would tell some funny stories tonight"
(Ideal Perfect)

ha nu-si-ne le_Pékeli he.sama [anterior], nu-cu-si-te-hi-nae ve.na-huo! [irrealis anterior]
COND.CL 1PL-ANT-LOC Berkeley at.the.same.time 1PL-IRR-ANT-ABIL-REFL-see without.noticing
"If we were to have been in Berkeley at the same time (and we might have been), we could have seen each other without realizing it!"
(Ideal Past)

ka piha i cu-via taa [irrealis] ha cu-me-vua [irrealis]
DEF yard VB.CL IRR-content surpass CON.CL IRR-COM-rain
"The yard would be happier if it were raining (but it's not)"
(Unreal Nonpast - protasis and apodosis reversed)

ka ámo.e ha na-cu-si-láhe-ni [irrealis anterior], ni-na-cu-éki-ne ve.ona ne-sáki.lo he.ti.tia [irrealis]
DEF wife COND.CL NEG-IRR-ANT-leave-1SG 1SG-NEG-IRR-sit-LOC drunk LOC-bar right.now
"If my wife hadn't left me (but she did), I wouldn't be sitting here drunk in a bar right now (but I am)"
(Unreal Perfect)

ha cu-si-me-vitu [irrealis anterior], ka nui i io-cu-si-mu.kino poli_i_taa [irrealis anterior]
COND.CL IRR-ANT-COM-dragon DEF world VB.CL TRANS-IRR-ANT-interesting much.more
"if dragons had existed (but they didn't), the world would have been much more interesting (but it wasn't)"
(Unreal Past)

One comment: as table note 4 above mentions, the Ideal clause types -- though valid -- are less likely to be used in flowing speech: I think they would be more formal or literary if used. The corresponding Unreal types are more natural Koa for both meanings; this causes a small amount of ambiguity in the interpretation of such clauses, but this would be resolved in context:

le_Lóliki ha cu-kii po kunu [irrealis], ta-io-cu-opi aha pe-imi [irrealis]
Olga COND.CL get UNIV dog 3SG-TRANS-IRR-learn something BEN-self
"If Olga got a dog (and she might), she would learn some things about herself" OR
"If Olga were getting a dog (but she's not), she would learn some things about herself"

Whew. It took me a day to write up the grammatical description and several weeks to write the examples...it's clear which is the preferred activity! But I'm happy with the range of meaning these sentences were able to showcase.

De Cuup, where are you?

Dear Robert,

I've been intrigued by your comments on my Koa posts over the years, and even more intrigued by the fact that you were apparently actually reading, and seemingly following and remembering, the content of my exposition at all. Who are you? None of my conlang friends seem to recognize you. Let's have a more substantial bidirectional conversation! My e-mail address is on the About Me page.

-Iúliki

Pronominal predicate correction

After a (decade-)long period of uncertainty, I announced back in 2021 that pronominal predicates -- that is, the form that personal pronouns take for emphasis and when they need to be able to perform all the syntactic roles of full predicates -- would have the following forms:

ni -> nika "I"
se -> seka "you"
ta -> taka "he/she/it"
nu -> nuka "we"
so -> soka "you guys"
tu -> tuka "they"

I've been doing some imaginary conversation practice this past week, and unfortunately I'm now pretty sure that was the wrong decision, for two reasons.

Prior to that 2021 post, we'd always assumed that the emphatic pronouns would simply be doubled versions of the simple ones, e.g. nini, sese, tata, etc. The problem here was that that meant that tata could not reasonably also mean "dad," which left papa in that role...about which I seethed with resentment and hatred. I have no idea why I've felt so strongly about the aesthetics on this one tiny issue, other than that tata is also "dad" in Polish.

I feel like, now that I'm in my 40's, I might finally be old and wise enough to rise above that pettiness. There isn't anything objectively wrong with papa for "dad," and in fact studying Swahili has helped: baba is really a rather nice "dad" word, and is nearly the same as papa.

ANYWAY, the primary issue with nika, seka, taka, &c. as we've had them the last few years is that they feel wrong in speech. Where I've tried to use them they just haven't worked, subjectively, and I've been surprised to find myself using the reduplicated versions instead. I wish I had any actual examples to discuss at this point; unfortunately these practice conversations seem to be the linguistic equivalent of those spontaneous music improvisation sessions with emergent unrecorded marvels that I can never reproduce.

One reason for that sense of wrongness -- maybe -- is that although nika and friends do look like pronominals (toka "that one," poka "everyone," nahuka "none of them"), the derivational process that would lead to them would actually give them a different meaning! To wit...

ti ulu "this fingernail" -> tika "this one"
ni ulu "my fingernail" -> nika "my one, mine" (!!)

Thus it would appear that, if anything, the personal pronominals with -ka formatives ought to be possessive pronouns!

...except that I don't really love possessive pronouns for Koa: they don't quite seem to match the soul of the language. Instead we have constructions with keme "attribute" and oma "one's own, belong to":

ka ulu kémeni "MY fingernail" -> ka kémeni "my one, mine"
ka ulu ómani "my very own fingernail" -> ka ómani "the one that belongs to me"

I will note that I came to a very similar (and well-researched) set of conclusions to literally all of the above about 7 months earlier in 2021 when I first tried to hash out the form of these pronominals in detail. This seems to happen to me; it can be hard to trust past Julie over the fire of the current moment, and sometimes it takes several cycles before things finally stick.

On the other hand, Koa has been starting to allow a much greater range of complexity recently as a sense of register has developed, and it occurs to me that nika et al. could continue to exist but just feel very formal or old-fashioned, rarely used in speech. Like maybe it could show up in legal contracts? On a meta-level this doesn't sound like I'm talking about an IAL at all, but on the other hand who's to say that the typical, purely philosophical historical attempts at IAL design have been right to eschew this kind of pragmatic range? What good is human language without deep poetry?

Given that it's a children's book, then, all this does have some implications for Are You My Mother? I think there are three tokens in that text of emphatic pronouns, which now need to change, e.g.:

Ni-ilo ka imi SESE, ka ame-nene i sano.
1SG-know DEF self 2SG.EMPH DEF bird-baby VB.CL say
I know the equals-YOU one, the baby bird said
"I know who you are, said the baby bird."

Ame sa se, e mama óma-ni sa!
bird FOC 2SG and mama belong.1SG FOC
A bird is what you are, and it's my own mama!
"You are a bird, and you are my mother!"

The degree to which these feel wildly better than what I first posted is some additional evidence that these revised decisions are sound!

I knew this kind of thing would happen as soon as I published that piece. I considered just making edits to the original post as they occurred to me, but it feels dreadful to erase past process like that. Maybe when I'm reasonably sure (ha ha) that there are unlikely to be future changes, I can release a PDF or something.

Very lastly, going back to tata vs papa for "dad," I might also note that this issue has spurred the creation of a fleet of other heretofore nonexistent kinship terms as well. This is still solidifying, though, and doesn't really bear on the topic at hand, so I'll save it for another post...

Sunday, March 9, 2025

Headless relatives and nominal subjects

This is a brief excursus to note an interesting consequence, and some interesting questions, arising from the way we've just decided to handle nominal subjects in relative clauses, in the context of the way we handle headless relatives specifically. This occurred to me with surprise just now while writing about something else and I didn't want to forget it!

To set this up, let's take a second to remember how predicates used adjectivally work in syntax. At base, the predicate simply follows the head it describes, without specifier:

ka sivu vihe
DEF leaf green
"the green leaves"

These adjectivals can also stand on their own with a specifier to describe an unnamed entity with the given characteristic, thus

ka vihe
DEF green
"the green one(s)"

It does occur to me to wonder in this moment if there's a "Ø" in the nominal slot in that kind of construction -- not the kind of thinking we've typically applied to the syntactic motivation of lexical class in Koa, but it's an interesting question:

?ka Ø vihe
DEF Ø green
"the green Ø"

Anyway, we can also give that adjectival a verbal force by turning it into a relative clause with u, thus

ka sivu u vihe
DEF leaf REL.CL green
"the leaves that are green"

So far so good, and uncontroversial in modern-day Koa. Now, looking at that last example, suppose we adjust the relative clause to have a nominal subject:

ka sivu [ le Kéoni u ako ]
DEF leaf [ NAME John REL.CL pluck ]
"the leaves John picked"

Here the entire clause le Kéoni u ako "that John picked" is functioning as an adjectival modifying the head ka sivu "the leaves." But what happens if, like ka vihe "the green ones" above, we want to delete the head noun? What if we just want to say "the ones John picked"? It would appear, keeping the structures parallel, that we would have to be left with the following!

ka [ le_Kéoni u ako ]
DEF [ John REL.CL pluck ]
"what John picked, the one(s) John picked"

Integrated into a matrix clause, we'd have:

Ai se-si-nae ka [ le_Kéoni u ako ] ?
QU 2SG-ANT-see [ John REL.CL pluck ]
"Have you seen the one(s) John picked?"

Even though this output follows logically, and I don't see anything wrong with the structure itself, it boggles my mind in a rather uncomfortable way. It's possible that I still need some time to get used to these newly-approved clause type markers and the way they show up in syntax; if that were older hat, perhaps the above wouldn't look surprising at all.

As an excursus to the excursus (an exexcursus?) I might mention that another allowable strategy for the relative clause up there would be to remove the clause marker and thereby make it non-finite; I've been saving the full discussion of these structures for some presumably upcoming post on nominalized clauses specifically. But just for completeness, this would be another way to say the same thing:

ka [ le_Kéoni ako ]
DEF [ John pluck ]
"the one(s) John picked"

Back to the plot, however weird it looks, ka le Kéoni u ako does again seem to be completely above board syntactically according to all the rules we've worked out to this point. Pushing into extremely speculative territory, though, I do have a tiny tingle of curiosity about whether -- in addition -- ka could itself show up as a clause type marker instead of u for this particular kind of construction. That would give e.g.

?le_Kéoni ka ako
John ?.CL pick
"what John picked, the one(s) John picked"

...or, in a larger clause,

?Ai se-si-nae [ le_Kéoni ka ako ] ?
QU 2SG-ANT-see [ John ?.CL pluck ]
"Have you seen the one(s) John picked?"

Honestly, I don't know! It's beautiful in the same way as the other dependent clauses, it just never, ever occurred to me that Koa syntax could possibly work this way. Since there's more than one possible specifier for the item(s) picked, though, would this logic take us into complete absurdity?

ti vihe
this green
"this green one"

po vihe
UNIV green
"green ones (in general)"

po_ka vihe
all green
"all the green ones"

Now with the standard headless relativization strategy:

ti [ le_Kéoni u ako ]
this [ John REL.CL pluck ]
"this one John picked"

po [ le_Kéoni u ako ]
UNIV [ John REL.CL pluck ]
"ones John picked (in general)"

po_ka [ le_Kéoni u ako ]
all [ John REL.CL pluck ]
"all the ones John picked, everything John picked"

So far so good...but

?le Kéoni ti ako
"this one John picked"

?le Kéoni po ako
"ones John picked (in general)"

?le Kéoni po ka ako
?po le Kéoni ka ako

"everything John picked"

Uhh...my parser definitely just broke. My instinct is the clauses with ka kind of make sense -- though I would need to do some research to decide what even to call such a clause...just "headless?" -- but that it's madness to allow any and every possible specifier to sub in for a clause type marker. Given, though, that it's a basic Koa principle that where one particle/predicate/structure of a given type can go, any other such particle/predicate/structure can go, we would have to be careful with definitions here. We'd have to say...that ka as a clause type marker is homophonous with ka "the" but is in fact a different marker, in the same way that ko forms abstract nouns but also has a separate identity marking finite clauses used as nominals.

Going to have to sit with that.

In the mean time, let's try a more complex sentence and see whether that creates a train wreck with any of these strategies: "I gave you what John said Mary wanted." That has two levels of embedded clauses, one of which is a headless relative and both of which have nominal subjects...

1. ni.ána.se ka [ le_Kéoni u sano [ le_Meli ko halu ] ]
I.gave.you DEF [ John REL.CL say [ Mary NOM.CL want ] ]

2. ?ni.ána.se [ le_Kéoni ka sano [ le_Meli ko halu ] ]
I.gave.you [ John HDLS.CL say [ Mary NOM.CL want ] ]

3. ni.ána.se ka [ le_Kéoni sano ko [ le_Meli halu ] ]
I.gave.you DEF [ John say COMP [ Mary want ] ]

Example 1 uses the standard form with finite dependent clauses; example 2 incorporates the experimental new headless relative clause marker; and example 3, for contrast, shows the alternative strategy in which both subordinate clauses are nominalized (nonfinite).

Between (1) and (3) I feel like I'm getting a difference in register. With finite dependent clauses there's no question that the syntax is much more complex, in perhaps a slightly formal/scientific/legal/nerdy way? The closest similar distinction I can draw off the cuff in English would be between "the things that Mary said that she wanted" and "what Mary said she wanted," but I think in Koa the difference in register is wider. Interesting.

With (2)...I think this structure is still too experimental for me to have intuitive feelings about it. I can get at le Kéoni ka sano as "what John said," but once that clause starts to take its own complement I can't process it at all. This may be a place where we need either more examples in actual use, more crosslinguistic evidence, or both.

What about other kinds of headless relatives, though? What about where John said it? Oh dear. Traditionally I think we would have this, with a ke-compound; unfortunately the article on theta clauses had not one single example clause with a nominal subject, so my choice of ko as the clause type marker here is intuitive rather than examined!

kene le_Kéoni ko sáno-ta
location John NOM.CL say-3SG
"(the place) where John said it

or, non-finite,

kene ko le_Kéoni sáno-ta
location COMP John say-3SG
"(the place) where John said it

Actually I think we're okay. By analogy to the other headless relative types we've just been discussing I was wondering if we might also have e.g. ne ka le Kéoni u sano, but that's something different: it would mean "in the things John said."

To be fully transparent, I have to say I'm wincing a bit at sentences like niánase ka le Kéoni u sano le Meli ko halu; the syntax is complex at a level that younger Koa would have rejected reflexively as clearly, obviously unacceptable to the charter and to the spirit of the language. I feel a bit wistful, or regretful, for our now lost fully modular structures, which would have made this sentence come out simpler and much more recognizable to Koa's original creole muses:

ni.ána.se ka [ le_Kéoni i sano ko [ le_Meli i halu ] ]
I.gave.you DEF [ John FIN say COMP [ Mary FIN want ] ]
"I gave you what John said Mary wanted"

I don't know what to do with how much I like that...and how much more I like it than most of what we've been talking about here today. I wonder how much choice between structural options is reasonable in a language: could all of these options be acceptable depending on style and register? Swahili certainly has multiple allowable strategies for relativization, for example, each kind of singing its own song. This feels like it's verging onto artlang territory just a bit, but maybe that's inescapable as soon as any conlang is subjected to the expressive needs of actual use, whether or not its speakers admit it.

Well, that was much less brief than I expected or intended, and way to make myself question everything yet again. I guess it turns out this was a corner of Koa syntax that still needed some rigorous investigation! ...and as to that final point, some additional soul-searching.

Saturday, February 22, 2025

Ai Se Sa Nimama? Interlinear - Part 2

This is the conclusion of the interlinear gloss and commentary for my Koa translation of Are You My Mother. Part 1 is here, and the full text is here.

Ka ame-nene i luvu ve cu-meti.
DEF bird-baby VB.CL cease ADV.CL IRR-think
The baby bird ceased (in order) that he might think
"The baby bird stopped to think."

Directly inspired by Nahuatl, here we have an irrealis adverbial clause ve cumeti "in order that he might think." This is one of two primary ways of showing purpose (more about this in an upcoming post on adverbial clauses); the other would be the nonfinite la ko meti, literally "for thinking."

Ka séne-to e ka móa-he i na-ta-mama.
DEF cat-CHILD and DEF chicken-FEM VB.CL NEG-3SG-mama
The kitten and the hen are not his mama
"The kitten and the hen were not his mother."

In re the little argument I had with myself in Part 1, here's yet more internal evidence that inalienably possessed nouns can be predicates on their own...

Ka kunu e ka léma-e i na-ta-mama.
DEF dog and DEF cow-FEM VB.CL NEG-3SG-mama
The dog and the cow are not his mama
"The dog and the cow were not his mother."

Ni-ku-me-mama, — ka ame-nene i sano,
1SG-OLD-COM-mama DEF bird-baby VB.CL say
I clearly am with mother, the baby bird said
"I have a mother, said the baby bird."

I originally translated this as ni-ia-me-mama with the ia "yes/certainty" viridical showing verum; in this interpretation, the baby bird was pushing back against a universe he felt was suggesting he might not have a mother. I don't think that's positively wrong, but as I thought about it, in this moment I don't think he's focusing on the truth of his statement so much as establishing a base of known information. Ku is much better, a particle which is almost impossible to translate succinctly and fully into English (Polish has a perfect translation, przecież: Matkę przecież mam... "I [obviously] have a mother..."). It indicates that the information being conveyed is old, and is already (or should already be) known. English can kind of get at it with "clearly," "obviously," "of course," or "you know."

Ni-ilo, ia-ki-tai.
1SG-know VERUM-DEB-be
I know, it must be
"I know I do."

...on the other hand, here he definitely is asserting truth (just like I am in this moment), so ia is the most appropriate translation.

Ni-lu-lúta-ta. Ni-cu-lúta-ta. IA-CÚ!
1SG-VOL-find-3SG 1SG-IRR-find-3SG VERUM-IRR
I want/intend to find her. I will/would find her. Will/would!
"I will find her. I will. I WILL!"

I'm attempting here to convey the baby bird's escalating commitment to the task at hand, though using somewhat different words than the English. I continue to be a little uncertain about full extension of contextual meaning available to each of these particles lu "volitive" and cu "irrealis"; I'm hoping that lu can convey intention in the first clause, and then that cu could be a prediction of the future in the others. Now I wonder if all three clauses are really more about determination than fortune telling, though...should I have done lu for all three? Nilulútata. Ialú. IALÚ!

I guess there are two issues here: the accuracy of the translation of the English meaning on the one hand, and the naturalness of the Koa on the other. When I read just the Koa, this feels like a reasonable thing for the bird to say in this moment, in line with the vibe of the English if not exactly the same. For the moment I'll leave things as they are with the lu version as an understudy...and I'd really better think more about distinguishing intentionality versus desire with lu, and futurity versus imagination with cu.

Ie-he-toa ka ame-nene vo nae a mea iso.
JUST-TEMP-that DEF bird-baby PRES.CL see INDEF thing big
Just at that time, behold, the baby bird sees a big thing
"Just then the baby bird saw a big thing."

Should I have thought more about this calque of the English "just then"? It might very well be idiomatic Koa, but it's also not a phrasing I've ever actually used before. On the other hand, the presentative vo is doing great work in this sentence.

SEKA sa ni-mama! — ta-sano.
2SG.EMPH FOC 1SG-mama 3SG say
You are the one who is my mama! he said
"You are my mother! he said."

Noting in passing the first recorded use of the emphatic/predicative version of a personal pronoun. The bird could have said just se sa nimama, but not with the degree of emphasis we see here.

Ka mea iso i sano — HUAUU!
DEF thing big VB.CL say snort
The big thing said SNORT
"The big thing said, 'SNORT!'"

I have no idea how to actually say "snort" in Koa, so I tried to imagine how one might write the sound of a steam shovel in Koa phonology...

Ii, na! — ka ame-nene i sano.
EMOT NEG DEF bird-baby VB.CL say
Yikes, no! the baby bird said
"Oh, no! said the baby bird."

Hidden away in the "Particles" tab of the Koa lexicon and never before mentioned aloud is a list of emotive noises. I was glad I had something ready to translate that "oh, no!" intentionally! The full list is:

aa understanding, surprise
ee uncertainty; filler (um, uh, er)
ei calling attention
eu disgust
ii pain, dislike or nervousness
oi request for repetition or confirmation
oo understanding, confirmation
ui regret, commiseration
uu excitement, pleasure

Na ni-mama sa se. Huauu mu-pato sa se!
NEG 1SG-mama FOC 2SG Snort CAUS-fear FOC 2SG
It's not my mama that you are. Snort cause-fear is what you are
"You are not my mother. You are a scary Snort!"

Ka Huauu i nose ka ame-nene
DEF Snort VB.CL raise DEF bird-baby
The Snort lifted the baby bird
"The Snort lifted the baby bird"

i la-nomu, la-nomu, la-nomu.
VB.CL DAT-upper DAT-upper DAT-upper
to-upper, to-upper, to-upper
"up, up, up."

Sii hua i osu.
next something VB.CL occur
Next something happened
"Then something happened."

Ka Huauu vo ie-pane ka ame-nene
DEF Snort PRES.CL JUST put DEF bird-baby
Behold, the Snort just put the baby bird
"The Snort put the baby bird"

ie "just" is translating the sense of "right back in the tree" in the next line.

i lai la ka puu.
VB.CL return DAT DEF tree
returns to the tree
"right back in the tree"

Ka ame-nene i ho-ne-koto!
DEF bird-baby VB.CL NEW-LOC-home
The baby bird is wow in home!
"The baby bird was home!"

Ho is exactly the opposite of ku, marking information that's new, previously unknown, or surprising. This could also have been a strategy for two clauses above, as in Ka Huauu vo hoiepane ka amenene. This particle's usage is still in the process of being fully understood so I'm not sure whether two ho's in quick succession like that would be too much stylistically; in case it would be, I thought it should be reserved for the conclusion of that series of dramatic occurrences.

Ie-he-toa ka ame-mama vo lai.
JUST-TEMP-that DEF bird-mama PRES.CL return
Just at that time, behold, the mama bird returned
"Just then the mother bird came back."

Ni-ilo ka imi SEKA, ka ame-nene i sano.
1SG-know DEF self 2SG.EMPH DEF bird-baby VB.CL say
I know the equals-YOU one, the baby bird said
"I know who you are, said the baby bird."

Na séne-to, na móa-he, na kunu sa se.
NEG cat-CHILD NEG chicken-FEM NEG dog FOC 2SG
It's not kitten, not hen, not dog that you are
"You are not a kitten or a hen or a dog."

Na léma-e, na Huauu sa.
NEG cow-FEM NEG Snort FOC
It's not cow, not Snort
"You are not a cow or a snort."

Ame sa se, e mama nika sa!
bird FOC 2SG and mama 1SG.EMPH FOC
A bird is what you are, and it's my mama!
"You are a bird, and you are my mother!"

No further commentary here other than that I particularly like the translation of the last five sentences. On the whole, warts and all, I'm pretty happy with this inaugural translation in general; I feel like a Koa-speaking toddler would find it fun, natural and idiomatic. I suppose now I ought to try another genre...maybe a short news story? That would maintain the presentative nature of the information structure, but flip to the far other side of formality of language: maybe an interesting exercise. I'll think about it.

Thursday, February 20, 2025

Finite clause types at last

In the winter of 2012, while on a dog walk in Washington Manor, out of the blue I had an idea about Koa clause marking. This idea felt crazy and revolutionary, but also beautiful, and possibly even ingenious. I dismissed it as too revolutionary and let it go for almost a decade, but never forgot it; and in the past few years of feverish Koa development I haven't been able to stop thinking about it.

The idea concerns the i that precedes the VP when it lacks a pronominal subject. Prior to 2012 I assumed the rule was simply that verbs have a pronoun slot that must be filled, so if an overt pronoun is absent, i is required as a placeholder. This is the case (more or less) in Toki Pona, which originally inspired this marker.

The revolutionary idea was that perhaps I had misanalyzed my own syntax. What if, instead of being a  pronoun, i is marking the clause type? And if that's the case, what if the ko I had previously regarded as a specifier that nominalizes clauses, or a complementizer, is actually just marking another type...and the u I thought was a relativizer was marking yet another? And what if there could be other markers too?

Borrowing from the original post in January 2012 in which I introduced and then immediately discarded this idea, this would give us the following marking for finite clauses:

i - verbal
ko - nominal
u - adjectival

...and the following template for the VP:

(SUBJECT) TYPE (PRONOUN) (TAM) VERB (OBJECT)

There's an important bit of subtlety here that I need to make sure to touch on before continuing. When languages nominalize clauses they usually become non-finite, sort of by definition; but as I said above, clauses with these markers are all finite. What one has to remember is that every Koa predicate can be verbal, nominal, or adjectival depending on its syntactic position; in this system, clauses are no exception. These clause type markers identify what the clause is doing in syntax -- i.e. whether the clause is acting like a verbal, nominal, or adjectival predicate -- but the clauses are still finite regardless.

Clause nominalization, the nonfinite variety, is an entirely different topic...though it does also exist in Koa, and would also be able to used for similar purposes! This is all very easy to confuse, and I think I've been consistently inconsistent or imprecise in my language here in the past: in Koa there's an extremely important distinction between a nominal clause and a nominalized clause. I should possibly ideally find a different term for one of these!

Anyway, those who have been following the plot closely may have noticed that I've been nonchalantly slipping structures like these into example sentences and translations for more than a year. After I discovered last year that Macedonian does "nominal clauses" this way too, e.g.

не сакам [ Јуле да знае ]
NEG want.1SG [ Julie NOM.CL know.3SG ]
"I don't want Julie to know"

...for which the precisely parallel Koa translation would be...

ni-na-halu [ le Iúliki ko ilo ]
1SG-NEG-want [ NAME Julie NOM.CL know ]

...and also discovered that Basque does nominal (and conditional, and adverbial) clauses this way:

[ bere aita Californian dago-ela ] esan_du Mikelek
[ his father California.LOC be.3S-NOM.CL ] say.3SG Michael.ERG
"Michael says his father is in California"

le_Mikele i sano [ taémaka ko ne le_Kalipónia ]
Michael VB.CL say [ his.father NOM.CL LOC California ]

...and that Swahili does relative clauses this way:

kitabu [ a-li-cho-ki-soma mtoto ]
bookβ [ 3SG.SUBJα-PAST-RELβ-3SG.OBJβ-read childα ]
"the book that the child read"

ka tusi [ ka toto u luke ]
DEF book [ DEF child REL.CL read ]

...I was convinced that this isn't a completely outlandish, typologically ridiculous strategy at all. It's certainly not augmenting Koa's IAL-worthiness, but as I keep pointing out, in some cases the art is starting to feel more important than the charter, and these structures feel right for Koa. They're simple and elegant, and Koa syntax as a whole actually makes more internal sense when they're handled this way.

And so, by the power vested in me, I hereby admit these structures into the Koa canon. As ever I have the right to change my mind, but I'm going to give them their fair shot and see how Koa feels in practice when it makes full use of them.

I alluded above to the possibility that there might even be more than the three clause types referenced so far, and indeed many more have surfaced. Here are the categories and subcategories of clause types as identified so far:

i - verbal: neutral unmarked
vo - verbal: presentative
oe - verbal: obligative
ea - verbal: hortative
ko - nominal
u - adjectival
ve - adverbial
ha - conditional

Notes about distribution:

1) Verbs with pronominal subjects have some special properties, which affect the use of these clause markers. i is not used (or is extremely marked, TBD); ko is optional; and u is also optional. The remaining markers are required in all circumstances regardless of clause type.

2) A clause carries one and only one clause marker; BUT

3) Serial verbs to the right of the main verb are marked by i regardless of the clause type of the main verb (see example below).

In any event, this is really just an introduction and opportunity for me to welcome this new(ish) syntax out loud. Many of these types will need a post of their own to demonstrate usage; first up and currently in progress is a discussion of the newly discovered and rather exciting conditional structures, which have been patiently waiting for someone to notice them for 25 years.

Oh, and about those actually legitimately nominalized clauses I mentioned earlier...it turns out I got those right in that same 2012 post as well, actually. They're formed with specifier version of ko, and all clause markers -- as markers of finiteness -- are omitted. One example for now, pending a full treatment in a later post:

le Mia i sano ko [ ka moa Ø ma-lalu i poli ]
NAME Mia VB.CL say NOM [ DEF chicken NONFIN IMPF-sing VB.CL much ]
"Mia said the chickers were singing a lot"