How letters connect

Arabic is a joined script — most letters change shape depending on where they stand in a word.

If you have looked at an Arabic word and wondered why the same letter seems to take several wildly different forms, the explanation is the joining system. Arabic is not written in discrete blocks the way printed English is; it is, fundamentally, a cursive script. Letters connect to one another along the baseline, and most letters take a different shape depending on whether they connect on one side, the other, both, or neither. There are four possible positions, and so up to four shapes per letter:

Remember that "left" and "right" here mean what they mean to the reader: in Arabic, "the next letter" lies to the left, and "the previous letter" lies to the right. The four-form table is one of the things that makes Arabic look intimidating to a beginner but starts to feel orderly within a few weeks.

Most letters connect both ways

Twenty-two of the twenty-eight letters connect on both sides. Their shape changes are usually small — a stub at one end shortens, a tail at the other end appears or vanishes — and the basic body of the letter stays recognisable. Baa is a textbook example. In its isolated form it is a small bowl with a single dot underneath. As an initial it loses the closing flick on the left so it can connect outward. As a medial it is reduced to a tiny tooth on the line. As a final it has the bowl-tail again, attached on the right.

بIsolated
بـInitial
ـبـMedial
ـبFinal

ʿAyn is more dramatic. The four forms look almost like four different letters to a beginner — the eye-shaped isolated form, a small open initial, a pinched closed medial, and a long sweeping tail in the final form — but they are all the same letter, and once you have read enough text the connection is automatic.

عIsolated
عـInitial
ـعـMedial
ـعFinal

Saad is another both-side connector with a clear oval body and a tooth on the trailing side, and demonstrates how an emphatic letter keeps its distinctive bowl shape across all four positions.

صIsolated
صـInitial
ـصـMedial
ـصFinal

The six non-connectors

Six letters break the joining system. They connect to the letter before them — that is, to the right — but never to the letter after. Whatever follows them must start a new segment. They have no medial form (because they cannot be in the middle of a connected segment) and no initial form (because they cannot connect onward). They have only two shapes: isolated, and final.

The six are:

Raa is a useful illustration of what this looks like in a word. Take دَرَس darasa, "he studied." It is one word, three letters: daal-raa-siin. But because both daal and raa are non-connectors, the word visually splits into three separate segments — daal on its own, raa on its own, and siin on its own — even though phonologically and grammatically it is a single, indivisible word. The gaps are not spaces; they are forced visual breaks.

رIsolated
No initial
No medial
ـرFinal

Word-spaces in Arabic are a little wider than these forced internal gaps, but the difference is subtle and it takes a learner some time to read it confidently. The single most useful thing you can do early on is memorise the six non-connectors. From then on, when you see a gap in a word, you can predict which letter is on its right.

The laam-alif ligature

One pair of letters fuses into a single mandatory shape: laam followed by alif. Written separately the two letters would look like ل + ا. In every printed and handwritten Arabic, however, the pair is written as a single ligature, لا, with the laam's vertical bowing into the alif's stroke. The ligature is so universal that fonts treat it as a separate glyph, and not handling it is one of the markers of a poor-quality Arabic font. The same fusion occurs after the definite article, so الـ + alif gives the long form الا seen at the start of many words.

The most-encountered word containing the laam-alif is لا itself, which is the standard word for "no."

What this means for reading

Two practical takeaways. First, do not try to learn the four positional forms as if they were unrelated shapes. Learn the letter, then learn how its body simplifies in the middle of a word, then learn how it grows a tail at the end. The variants are systematic and quickly become invisible. Second, internalise the non-connectors early. The single biggest reason an Arabic word looks like a string of disconnected blots to a learner is that they have not yet noticed which letters force a break. Once you have, the script reads as a single ribbon — broken by spaces between words, and broken in places within words by the six.