Abhiroop Sarkar

Remember why you started

2025-07-09T00:00:00+00:00

I think that it’s extraordinarily important that we in computer science keep fun in computing. When it started out, it was an awful lot of fun. Of course, the paying customers got shafted every now and then, and after a while we began to take their complaints seriously. We began to feel as if we really were responsible for the successful, error-free perfect use of these machines. I don’t think we are. I think we’re responsible for stretching them, setting them off in new directions, and keeping fun in the house. I hope the field of computer science never loses its sense of fun. Above all, I hope we don’t become missionaries. Don’t feel as if you’re Bible salesmen. The world has too many of those already. What you know about computing other people will learn. Don’t feel as if the key to successful computing is only in your hands. What’s in your hands, I think and hope, is intelligence: the ability to see the machine as more than when you were first led up to it, that you can make it more.

—Alan J. Perlis (April 1, 1922 – February 7, 1990)

From the “Dedication” paragraph of the legendary Structure and Interpretation of Computer Programs book, read 8 years ago!

On Equality

2025-03-19T00:00:00+00:00

One of the most fundamental notions in type theory is equality. There are various forms of equality, including propositional equality, judgmental equality, typal equality, extensional equality, intensional equality, path equality (a key concept in homotopy type theory), among others. The list goes on, and any deep exploration of the concept of equality could easily expand to book length. Instead of attempting to cover all these notions at once, I believe it’s best to build this blog post incrementally.

Martin-Löf Equality

The Swedish logician Per Martin-Löf pretty much introduced¹ the notion of dependent typing (eponymously known as Martin-Löf type theory) to the world. Every theorem prover/proof assistant in existence owes its origin to Martin-Löf’s seminal work.

A key concept in his type theory is what we call Martin-Löf equality, a dependent type that takes a fixed term x : A and maps any term y : A to the type x ≡ y. I will be using Agda in this blog post. The equality definition in Agda looks like the following:

data _≡_ {A : Set} (x : A) : A → Set where
  refl : x ≡ x

The {A : Set} is a type parameter, meaning the equality type is polymorphic over any type A in the universe Set. refl : x ≡ x is the sole constructor of the equality type. It states that the only way to construct a proof of equality is by showing that a term is equal to itself (reflexivity).

Leibniz Equality

One of the earliest notions of equality was given by the philosopher and mathematician Gottfried Wilhelm Leibniz. Leibniz in his 1686 treatise², Discourse on Metaphysics, asserted the identity of indiscernibles - two objects are equal if and only if they satisfy the same properties.

The above is known as Leibniz Equality. Leibniz equality is usually formalised to state that x ≐ y holds if every property P that holds of x also holds of y. Or,

_≐_ : ∀ {A : Set} (x y : A) → Set₁
_≐_ {A} x y = ∀ (P : A → Set) → P x → P y

Note the use of Set₁, which represents a hierarchy of types where Set : Set₁, Set₁ : Set₂, etc to bypass Russell’s Paradox and Girard’s Paradox. A recently published work showed that Leibniz Equality is isomorphic to Martin-Löf Equality. The fact that Martin-Löf equality implies Leibniz equality, and vice versa is short enough to be shown here as follows:

≡-implies-≐ : ∀ {A : Set} {x y : A}
  → x ≡ y
    -----
  → x ≐ y
≡-implies-≐ x≡y = λ P Px → subst P x≡y Px

≐-implies-≡ : ∀ {A : Set} {x y : A}
  → x ≐ y
    -----
  → x ≡ y
≐-implies-≡ {A} {x} {y} x≐y = Qy
  where
    Q : A → Set
    Q z = x ≡ z
    Qx : Q x
    Qx = refl
    Qy : Q y
    Qy = x≐y Q Qx

Footnotes

Work by Howard and de Bruijn predates Martin-Löf's work, however, Martin-Löf is credited with formalising dependent types into a coherent framework for constructive mathematics ↩
A work from 1686 is by far the oldest paper that I have referred to in my academic career ↩

20 Films That Inspire, Teach and Move

2025-02-09T00:00:00+00:00

Inspired by Satyajit Ray’s Deep Focus—a compulsory reading at FTII Pune and NSD Delhi—I was struck by the depth and nuance with which Ray analyses his favourite films and directors. This led me to finally compile a list of my own 10 (+10) favourite films. However, I soon realised that rather than ranking them purely on cinematic merit, my choices were more personal—a reflection of the life lessons they offer.

Some of my favourite films, like Pulp Fiction or any Christopher Nolan film, don’t necessarily impart a life lesson—nor is it the director’s duty to teach one. But in the end, I felt compelled to share this list anyway. Below, I’ve written a few lines about what drew me to these works. To the reader, I can assure you that each of these films will be a moving experience if you take the time to watch them—and I hope you enjoy them as much as I did!

1. 12 Angry Men (USA)

Surprisingly, the easiest pick for me was number one. From the moment I began this list, I knew that the 1957 film - 12 Angry Men undeniably belonged atleast in the top three. The film casts a sharp eye on the justice system, exploring its fragility and biases.

A phrase that originated in the legal system and has since become a mainstay in everyday conversation—”beyond reasonable doubt“—is central to the film. 12 Angry Men takes the concept of “reasonable doubt” and turns it on its head, exposing how deeply personal prejudices shape our judgment, often influenced by the social strata we belong to. It challenges us to question our assumptions and remain wary of absolute truths in society.

At its core, the film is also a sobering reflection on how our justice system hangs by a thread—a thread held by the jury. Their decisions, often swayed by personal moods or circumstances, can mean the difference between life and death for a defendant.

Keywords: Reasonable Doubt, Truth, Law, Justice, Ethics

2. Incendies (Canada)

To this day, the climax of Incendies gives me goosebumps. Based on Wajdi Mouawad’s play of the same name, this screen adaptation weaves a non-linear narrative around the Lebanese Civil War of 1975 and its profound impact on Nawal, an Arab immigrant in Canada.

As Nawal’s children uncover her past, piecing together the fragments of her life, we come face to face with the harsh realities of war—the suffering of ordinary people, the state of women in a war-torn society, honor killings, and other deeply powerful themes. The film has often been compared to Greek tragedies, and its climax remains one of my all-time favorites. It is the kind of ending that compels you to sit in silence, reflecting on what you have just witnessed.

Keyword: Anti-war

3. Aamis (India)

Aamis (English: Ravening) is probably the only non-commercial, slightly artsy film on this list. This ingenious hidden gem takes a seemingly simple concept—the food we eat—something innocuous and uncontroversial, and pushes it to an extreme that challenges our fundamental beliefs. To what degree do we pursue hedonism without realizing how far we are pushing the boundaries of what we consider normal?

At the same time, Aamis is one of the few Indian films that shines a light on the often-overlooked region of Northeast India and its food traditions. However, does its imagery reinforce binary stereotypes? I came across this critique in a social-theory paper on Aamis. Regardless, this film is so thought-provoking that it will undoubtedly leave you reflecting on your own belief systems.

Keywords: Food Habits, Belief system, Hedonism, Prejudice

4. Whiplash (USA)

Whiplash explores a complex and fascinating topic—the relentless pursuit of perfection and the lengths one will go to achieve it. Set in a fictionalised version of the legendary Juilliard Music School, New York, the film follows a ruthless teacher (J.K. Simmons) who pushes his students to their limits in pursuit of excellence.

While this list focuses on broader themes rather than individual performances, Simmons gives a performance of a lifetime as a perfectionist jazz teacher. The film delivers a crucial message: perfection comes at a cost. It breaks you in ways unimaginable at the start, but the view from that mythical peak is one most of us can only dream of—and in the climax, the film gives us a small taste of it.

Keywords: Perfection, Excellence, Toxic Relations, Teacher

5. Princess Mononoke (Japan)

And finally, we arrive at a Studio Ghibli film. Miyazaki’s movies are literal works of art, with every frame painstakingly hand-drawn by animators. Princess Mononoke stands as a testament to Hollywood and the world at large on the power of strong female leads and leadership. The film evokes nostalgia for pre-industrial Japan—a time of harmony between people and nature—while serving as a stark reminder of how the relentless march of industrialisation has left us increasingly desolate and disconnected.

Beyond the eponymous Princess Mononoke, the character of Lady Eboshi is perhaps my favourite grey character in any film. She is strong and independent, offering refuge to society’s outcasts—former prostitutes and people with leprosy—yet she is also the ruthless leader of Irontown. Her character serves as a reminder of the difficult balance a just leader must strike in times of conflict.

Every scene in this film feels like a stroll through an art gallery—an absolute masterpiece and a true delight to watch!

Keywords: Nature, Greed, Pacifism, Industrialisation

6. Mother (Korea)

We arrive at Bong Joon-ho, who, in my opinion, is one of the greatest living directors. Mother is an oft-overlooked masterpiece that I personally consider his best. It is also one of his more complex films, layered with multiple underlying themes. At its core, it portrays a disconnected society—one where the murdered body of a young girl lies in broad daylight, yet, because of her social status, no one looks closely or intervenes.

As the title suggests, the film is ultimately about a mother’s love—the lengths to which she will go to protect and care for her child. It also sheds light on the bullying and mistreatment children endure in their daily lives. However, in classic Bong Joon-ho fashion, these plot threads are twisted in unexpected ways, culminating in a devastating climax.

By the end, you are left conflicted, confronting one of my favourite themes: what is truth? Are we truly seeing it, or are we merely looking through a blurred window?

Keywords: Truth, Mother, Disconnected society, Single Parenthood, Blind Love

7. Spirited Away (Japan)

Widely regarded as Miyazaki’s masterpiece and the film that introduced many Western audiences to the magic of Studio Ghibli, Spirited Away is a work of unparalleled beauty. Its themes are too numerous to outline fully, but for me, the most striking is the innocence of childhood. Even today, when I watch Spirited Away, I find myself connecting with Chihiro—her mixture of awe and fear as she navigates an unfamiliar world resonates deeply.

The film powerfully explores separation anxiety and the pain of losing loved ones. In fact, the more I reflect on it, Spirited Away feels like a modern existentialist work. It aligns with Sartre’s notion that “existence precedes essence”—a lifelong struggle to define who we are. Yet, unlike the bleak conclusions of some French existentialists, Miyazaki offers a more hopeful vision. He shows that friendship, courage, and kindness can help us navigate even the most unfathomable worlds. Above all, the film reminds us of the importance of treating every life form with compassion.

If you haven’t yet watched Spirited Away, I can only say—I envy you. You have the rare joy of discovering this masterpiece for the first time. And of course, I can’t resist sharing the iconic train scene:

Keywords: Separation Anxiety, Alienation, Bravery, Valiance, Belief in Oneself

8. Up (USA)

Of course, no list of moving films would be complete without a Pixar entry. Among Pixar’s many inspiring works, I personally consider Up the best (sorry, Wall-E and others!).

Although marketed as a children’s film, Up tackles profound and often painful themes right from the start. Within its opening moments, it addresses the struggles of a couple facing fertility challenges, followed shortly by the devastating loss of a partner and the heartbreak that follows. The short montage Married Life has already achieved cult status as a cinematic masterpiece—and incredibly, this is just the beginning of the film!

The rest of Up lives up to that emotional depth, exploring the disillusionment that comes from meeting our childhood heroes and realising their fallibility. Along the way, the characters, through kindness, loyalty, courage, and wit, learn one of life’s most powerful lessons: the importance of letting go. The film delivers one of the most poignant depictions of this theme, reminding us that it is never too late to embrace new adventures. It brings to mind the saying, “You can never cross the ocean until you have the courage to lose sight of the shore.”

Keywords: Letting go, Kindness, Loyalty, It is never too late, Dont meet your heroes

9. Three Billboards Outside Ebbing Missourie (USA)

Three Billboards Outside Ebbing, Missouri powerfully explores the theme of navigating rage. At its core, the film follows two unlikely companions on a shared journey—Frances McDormand’s and Sam Rockwell’s characters—both grappling with the loss of loved ones. One has lost her daughter, while the other lost his father at a young age. The film subtly reveals how these tragedies have transformed them into figures consumed by anger, resentful of the unfair hand they believe the world has dealt them. Yet, over time, they both come to understand the profound power of forgiveness and the peace it can bring.

Sam Rockwell’s character arc is one of the most compelling anti-hero transformations I’ve ever seen. A particularly moving moment is the series of letters from Chief Willoughby, in which he asks for forgiveness—not just for himself, but as an appeal for others to embrace forgiveness and find inner peace.

Another central theme is the elusive nature of justice. As Willoughby profoundly reflects in one of his letters: “What you need to become a detective is love. Because through love comes calm, and through calm comes thought. And you need thought to detect stuff sometimes…” With stellar performances and a powerful narrative, Three Billboards Outside Ebbing, Missouri stays with you long after the credits roll.

Keywords: Forgiveness, Love, Kindness, Justice, Rage

10. The Wailing (Korea)

Another Korean masterpiece, The Wailing evokes horror without relying on jump scares or other traditional horror tropes. It is a richly layered film, filled with subliminal messages so intricate that capturing them all is nearly impossible. Themes such as the generational divide, the struggle between tradition and modernity, xenophobia, and the tension between Christianity and traditional Korean and Japanese shamanism run deep throughout the story.

At its core, the film explores how mistrust within an isolated community—fueled by prejudice—can lead to societal collapse, ultimately allowing a symbolic Satan to take hold. Even years after its release, audiences continue to analyse its symbolism, debate its ambiguous climax, and grapple with the question of who is truly good or evil. I highly recommend checking out multiple analyses of the film—it’s a mystery that keeps unfolding with every rewatch.

Keywords: Superstition, Mistrust

11. Tumbbad (India)

In a single sentence, Tumbbad is the ultimate personification of greed. And in 2025, as we witness those in power shaping policies under the guise of good intentions and the greater good, Tumbbad lays bare what truly lurks beneath the surface. It’s unlikely that any follow-up film could so vividly capture and embody the essence of greed and its transformative grip on a person.

Keyword: Greed

12. Parasite (Korea)

The first and only non-English film to win an Oscar for Best Picture, Parasite is perhaps Bong Joon-Ho’s most well-known film in recent years. Much has already been said and written about this masterpiece, where nearly every scene serves as a symbol of economic disparity between the wealthy and the working class in Korea. For one family, the rain is a beautiful spectacle; for another, it brings devastation as they struggle to save their home. The film masterfully portrays this divide through its literal uptown-downtown contrast. Parasite is probably the most commercial and easily accessible film on this list.

Keywords: Class Struggle, Inequality

13. Swades (India)

Swades is the most personal film on this list. I don’t think I can do justice to everything it represents in just a few lines—maybe a full blog post or even an academic paper would be more fitting. Beyond being an emotional journey into the heartland of India, Swades is also a cinematically brilliant film.

Mohan (an allusion to Mohandas Karamchand Gandhi) and Kaveri Ammi (symbolic of the river Cauvery) have long been separated, yet his inner guilt and sense of duty pull him back to her village. Most intriguingly, the film presents two versions of Mohan. When he first arrives, he is not the typical caricature of an NRI disconnected from India. He knows the country well—its Constitution, its major rivers, its politics, its corruption, its strengths, and its flaws. I personally related the most to this version of him. But then something happens, and we see a transformation. This time, Mohan doesn’t just know—he understands. He witnesses firsthand the inequality and systemic casteism tearing India apart and realises that to make a difference, he must be much more than who he was when he arrived.

It’s the difference between knowledge and wisdom, much like the one Hermann Hesse speaks about in Siddhartha. I highly recommend watching this film—but preferably under my supervision, so I can relentlessly comment on every frame and dissect every hidden meaning along the way :) .

Keywords: India, Caste system, Inequality, Gandhism, Connecting to your roots

14. The Shawshank Redemption (USA)

Thank you, Stephen King!

Keyword: Hope

15. Agantuk (India)

We arrive at the maestro—Satyajit Ray. Agantuk (English: The Stranger) is set in 1991 Bengal, a pivotal year marked by India’s liberalisation. The film portrays a middle-class nuclear family grappling with distrust when a mysterious stranger arrives at their doorstep, claiming to be a long-lost relative. However, this stranger unsettles their deeply held beliefs, challenging the very foundations upon which their society stands.

Through thought-provoking discussions, the film raises questions about the Western definition of “civilisation.” The uncle expresses regret that, shaped by Marx, Freud, and even Rabindranath Tagore, he has become entangled in the pursuit of being “civilised” while fearing the Western construct of the “savage”. His love for indigenous cultures and his fear of becoming a kupomanduk (a frog in a well) are understood only by the young protégé—a reflection of Ray’s brilliance in capturing the depth and sensitivity of children on screen.

Agantuk is Ray’s final film, and while his later works are often considered weaker due to their emphasis on dialogue over imagery, I believe this is one of the most accessible entry points into his cinematic world. I leave you with this iconic clip from Agantuk on civilisation.

Keywords: Identity, Materialism, Civilisation

16. Howl’s Moving Castle (Japan)

The third Miyazaki film on this list, Howl’s Moving Castle, primarily explores themes of vanity and outward appearance. It reflects our tendency to judge a book by its cover, yet in Howl’s, nothing is as it seems—old people appear young, young people appear old, a prince takes the form of a scarecrow, and unlikely heroes seem evil. As in many of Miyazaki’s works, the film features a strong female lead who, with quiet confidence, authenticity, and kindness, remains unfazed by outward appearances. Instead of being deceived by illusions, she embarks on a journey to help both herself and those around her. Beneath its fantasy elements, the movie delivers a profound message about self-awareness and the power of staying true to oneself.

Keywords: Idenity, Vanity, Pacifism, Authenticity

17. Coco (USA)

As C.S. Lewis put it, “A children’s story that can only be enjoyed by children is not a good children’s story.” In that spirit, Coco, though more of a children’s film than Up, is one that any adult can appreciate. The overarching theme is “Don’t meet your heroes”, as those we idolise for their perfection and infallibility may, in reality, be deeply flawed and more susceptible to corruption than we ever imagined. Along the way, the film offers a breathtaking visual spectacle that makes Coco an unforgettable experience.

Keywords: DONT MEET YOUR HEROES!

18. The Iron Giant (USA)

Another children’s film, The Iron Giant, explores our fear of the unknown—a theme reminiscent of the AI taking over trope. Set during the Cold War, with tensions between the U.S. and the Soviet Union at their peak, the film follows the eponymous giant, a being created as a weapon. Yet, through the kindness of a young boy, he learns that we are not bound by our origins—that we can choose who we become, no matter how we were made.

Keywords: Kindness, Fear of Unknown

19. Sairat (India)

Please avoid the many remakes and stick to the original 2016 Marathi film, Sairat. Director Nagraj Manjule casts a revealing eye on the regressive laws and institutionalised norms of the caste system that shape the lives of men and women in cross-caste relationships. Watching Sairat felt as though the director had captured an entire TV series within the film’s runtime. If you decide to watch it, I highly recommend going in without reading any detailed analyses. Since seeing this film, I have become an ardent admirer of Manjule’s work, and I leave you with this clip where he delivers a stellar response to why he isn’t making movies like Godzilla or King Kong in India.

Keywords: Class Struggle, Caste System

20. Rang De Basanti (India)

Lastly, I conclude my list with Rang De Basanti, a once-in-a-generation film that challenged both millennials and the generations before them in their perception of a commercial Hindi movie with a message. Rang De Basanti features one of the best anti-hero character arcs in recent memory—Laxman Pandey, whose steadfast belief in the ideals of Hindutva turns to heartbreak as he sees the hollow shell of the movement and what it truly stands for, followed by his rebellion for what he believes is a just cause. One must also note the ingenious device of comparing each character’s arc with their corresponding counterpart from the Indian freedom struggle.

The film addresses the generational divide about the teenagers and the twenty somethings during the freedom revolution of India, how they thought and worked and how disconnected the youth today is from the youth then. But somewhere when an oppressing force casts us and lays bare our society with corruption, the youth have the fire within them to stand up and realise something which they didnt know lives within them. As written by Prasoon Joshi “अभी-अभी हुआ यक़ीन की आग है मुझमें कही” (Translation: “Only now did I believe that there lives a fire inside me somewhere”)

Keywords: Youth, Rebellion

Square Limit by M.C. Escher

2024-11-04T00:00:00+00:00

This is a digital rendition of M.C. Escher’s square limit artwork. The most surprising thing about this piece is that it is composed of only four basic tiles shown below:

The entirety of this art is generated by taking these four tiles and using the power of Functional Programming à la function composition to create several powerful combinators that can render the infinite shapes. This technique was illustrated by Peter Henderson in his 1982 seminal paper - Functional Geometry. The paper mentions Mary Sheeran (who happened to be my PhD supervisor!) as the original implementor in UCSD Pascal. I have implemented the same in Haskell using the wonderful diagrams library.

The code intentionally avoids complicated rasterization and related techniques from computer graphics, so that the simple (yet powerful) essence of functional geometry is evident at a glance. The heart of the logic is this:

squareLimit :: Diagram B
squareLimit = cycle corner

corner = nonet corner2 side2 side2
               (rot side2) uTile (rot tTile)
               (rot side2) (rot tTile) (rot qTile)

nonet p1 p2 p3 p4 p5 p6 p7 p8 p9
  = onTopOf 1 2  (nextTo 1 2 p1 (nextTo 1 1 p2 p3))
   (onTopOf 1 1  (nextTo 1 2 p4 (nextTo 1 1 p5 p6))
                 (nextTo 1 2 p7 (nextTo 1 1 p8 p9)))

nextTo m n d1 d2 = scaledD1 ||| scaledD2
  where
    scaledD1 = d1 # scaleX (m / total)
    scaledD2 = d2 # scaleX (n / total)
    total = m + n

onTopOf m n d1 d2 = scaledD1 === scaledD2
  where
    scaledD1 = d1 # scaleY (m / total)
    scaledD2 = d2 # scaleY (n / total)
    total = m + n

side2   = quartet side1 side1 (rot tTile) tTile
corner2 = quartet corner1 side1 (rot side1) uTile

uTile = cycle $ rot qTile
tTile = quartet pTile qTile rTile sTile

cycle p1 = quartet p1 (rot $ rot $ rot p1) (rot p1) (rot $ rot p1)

quartet p q r s = scale (1/2) $ centerXY ((p ||| q) === (r ||| s))

rot p = rotate (90 @@ deg) p

pTile = makeTile markingsP
qTile = makeTile markingsQ
rTile = makeTile markingsR
sTile = makeTile markingsS

The complete executable code is available here. To the best of my knowledge, this is one of the best illustrations of the power of function composition to create complex software. There is not a single wasted line, and the essence of the code is as clear and declarative as possible.

Solving Monty Hall with the List monad (or Probabilistic Programming)

2024-07-25T00:00:00+00:00

Continuing from my Bayes Theorem-based blog post on solving the Monty-Hall problem, I will now show a rather elegant way to model the solution that exploits the humble List monad. To remind the reader, the Monty Hall problem involves a contestant initially choosing one of three doors, hoping to find a car behind it. Monty Hall then opens a different door, revealing a goat, and asks the contestant if they want to switch their initial choice. The goal of the solution is to guide the contestants to make this switching choice such that they have a higher probability of winning. This post was inspired by a less-than 100 lines implementation of a probabilistic programming monad in Haskell

We will begin by representing discrete distributions as a Haskell datatype:

type Prob = Double -- probability of an outcome

newtype Dist a = Dist { unpackDist :: [(a, Prob)] }

This is a nicer reformulation of distributions that the List monad could naturally model. For example, in an experiment with 5 tosses (modelled as data Toss = Head | Tail), that leads to 4 Heads and 1 Tail, our Dist type will model this as Dist [(Head, 0.8), (Tail, 0.2)]. However, we could have modelled this as a plain list as [Head, Head, Head, Head, Tail]. It is simply that the Dist type is nicer to work with (but it is a wrapper on the plain List).

Next, to compute probabilities accurately, it is easier for our representation if we normalise the distribution. We can do that using:

normP :: [(a, Prob)] -> [(a, Prob)]
normP xs = map (\(x, y) -> (x, y / (sumP xs))) xs
  where
    sumP :: [(a, Prob)] -> Prob
    sumP = sum . map snd

Given the above, we can now define the uniform distribution:

uniform :: [a] -> Dist a
uniform = Dist . normP . map (, 1.0)

> uniform [Head, Head, Head, Head, Tail]
Head | 0.8000
Tail | 0.2000

The Show instance of Dist ensures that it sums up the probabilities of the same outcome, giving the above. We don’t show the Show instance here, that and other useful probabilistic combinators are presented in this blog post.

Monad instance

The Monad instance of Dist is the most important construct for modelling Monty Hall. We will use the Monad instance to model conditional discrete distribution. Given two discrete random variables X and Y, we can compute their joint distribution (given that they are not mutually independent) as follows:

\[Pr({X = x} \cap {Y = y}) = Pr(X = x | Y = y) . Pr (Y = y)\]

We can convert this idea to the Monad instance quite naturally:

instance Monad Dist where
  (Dist xs) >>= f = Dist $ do
    (x, p)  <- xs
    (y, p') <- unpackDist (f x)
    return (y, p * p')

The p * p' is multiplying the probabilities of the two outcomes. The join function of the Monad instance can allow us to consider an alternate view of the above. Considering a distribution of distributions (please don’t try to visualise), we want to flatten it into a single distribution as follows:

join :: Dist (Dist a) -> Dist a
join (Dist dist) = Dist $ do -- dist  :: [(Dist a, Prob)]
  (Dist dista, prob) <- dist -- dista :: [(a, Prob)]
  (a, prob2)         <- dista
  return (a, prob * prob2)

Monty Hall

Armed with this small set of combinators above, we now model the Monty Hall problem. We start by defining the outcome type:

data Outcome = Win | Loss deriving (Show, Eq, Ord)

And now, let’s intuitively specify the switching strategy : If our first choice is the winning door, followed by which Monty Hall opens a door that contains a goat, if we switch, then we will certainly lose. That is because we already selected the winning door, and the switch will cost us the victory. However, if our first choice is the losing door, followed by Monty Hall opening the door containing a goat, if we switch, then we certainly win. Because we chose the losing door, Monty Hall opened the other losing door and the remaining door certainly assures our victory.

In the specification above we used the language of certain victory or certain loss, so we need to model certainty. And that is modelled as:

certainly :: a -> Dist a
certainly a = Dist [(a, 1.0)]

So, one outcome that has a probability of 1.0. And, with that, we can easily translate the English specification above, to the Haskell snippet below:

switching :: Dist Outcome
switching = do
  firstChoice <- uniform [Win,Loss,Loss]
  if (firstChoice == Win)
  then {- switching will -} certainly Loss
  else {- switching will -} certainly Win

The comment {- switching will -} is added such that the specification almost translates to literal English. Now, we can observe the distributions:

> switching
 Win | 0.6667
Loss | 0.3333

This shows us that switching our choice will allow us to win 66% of the time while losing 33% of the time. The first choice has the following distributions:

> uniform [Win, Loss, Loss]
 Win | 0.3333
Loss | 0.3333
Loss | 0.3333

Our computation proceeds by certainly losing if we switch after winning the first round, while certainly winning if we switch after losing the first round. Hence the calculation proceeds as:

 Win ~> certainly Loss| 0.3333 * 1.0
Loss ~> certainly Win | 0.3333 * 1.0
Loss ~> certainly Win | 0.3333 * 1.0

==>

Loss | 0.3333
Win  | 0.3333
Win  | 0.3333

==>

 Win | 0.6667
Loss | 0.3333

Hence, the contestant should switch!

Source code for the above is available at Abhiroop/bayes

The Monty Hall Problem and Bayes Theorem

2024-06-28T00:00:00+00:00

Today, while reading Steven Pinker’s latest book on critical thinking, Rationality, I was scribbling a solution to the Monty Hall problem. The Monty Hall problem is probably one of the most well-studied pop-science problem statement with different intuitive explanations of the solution. In case, dear reader, you have been living under a rock here is the problem statement:

Setup: A game show where you as a contestant have to guess to win a prize. The game comprises three doors. Behind one door is a car (the prize), and behind the other two doors are goats.

Initial Choice: You choose one of the three doors.

Host’s Action: The host, Monty Hall, who knows what is behind each door, opens one of the remaining two doors, revealing a goat.

Decision Point: You are then allowed to stick with your original choice or switch to the other unopened door.

Question: How should you maximise your chances of winning the car?

Now, somehow through various coffee table discussions, I always knew the answer was that the contestant should switch their choice. So many times I have had this encounter that I realised I never bothered to calculate or question my intuition on why the contestant should switch. However, the book informed me that even some of the leading mathematicians and statisticians of the era, including the legendary Paul Erdős, were stumped by the answer to the problem. Erdős was convinced of the solution only after he witnessed simulations rather than a proof or calculation!

So, I decided to reuse my ninth-standard probability lessons of Bayes Theorem to craft a solution. I have not at this point read through any solution, except the one that Pinker provides in his book. Pinker’s solution (I am sure this particular solution was not Pinker’s proposal but his explanation of a known solution) is quite different and uses a gameplay simulation and informal intuitions to drive home the result (and I think his explanation works). The Goodreads reviews particularly point out that and appreciate Pinker’s lucid and intuitive explanation. If you are interested in reading more about his solution, I encourage you to purchase the book. However, I will use an established result of the Bayes theorem of conditional probability, which states:

\[Pr(A|B) = \frac{Pr(B|A) . Pr (A)}{Pr(B)}\]

I have written enough PL papers that I am itching to start explaining the notation and domains, but it is 3:25 am and I am hoping that the reader can follow along with a basic ninth standard notation. If not, please jump to the wiki article on Bayes Theorem.

Now, let’s consider the Monty Hall problem and assume that among the three doors, the contestant chooses the first one. Seeing this, Monty Hall opens the third door to reveal a goat. Then, Monty Hall asks the contestant if they want to switch. Let me draw the scenario at this point

   ---         ---         ---
  | C |       |   |       | M |
  Door 1      Door 2      Door 3

The C on Door 1 marks that the contestant chooses the door and the M on Door 3 marks that Monty Hall opened it to reveal a goat. The contestant now needs to maximise their probability of winning a car. Should they stick with Door 1 or switch to Door 2? In effect, they need to calculate:

\[Pr(car\ in\ 2|my\ choice\ 1,\ monty\ opened\ 3) = \frac{Pr(my\ choice\ 1,\ monty\ opened\ 3|car\ in\ 2) . Pr (car\ in\ 2)}{Pr(my\ choice\ 1,\ monty\ opened\ 3)}\]

Let’s calculate the probabilities on the right side. Pr(car in 2) is the simplest one. Given that it is not constrained by any conditions, the probability that the car is in one of the three doors (in this case door 2) is simply 1/3

Next, Pr(my choice 1, monty opened 3|car in 2). This one is interesting. Given that the car is in door 2, what is the probability that Monty Hall opened door 3, while the contestant chose door 1? This one is interesting because the problem statement makes certain assumptions - (1) Monty Hall actually knows where the car is, (2) Monty Hall wants the show to last at least two rounds. Holding assumption 2, Monty Hall will intentionally open a door with a goat such that the show can progress to the next round. Given that, in our conditional probability scenario, Monty can never open door 2, because the car will be revealed (remember assumption 1 that Monty knows where the car is) and the show won’t progress to the next round. Similarly, Monty Hall can never open door 1, because the contestant has already made door 1 their choice. If door 1 houses a goat or a car, in both cases assumption 2 is invalidated and the show abruptly ends. Hence, the only choice that Monty Hall has is to open Door 3. So, the overall conditional probability Pr(my choice 1, monty opened 3|car in 2) is in fact 1! Door 3 is the only choice Monty Hall has.

Finally, Pr(my choice 1, monty opened 3). This is a weaker version of the previous probability, where the condition car in 2 is no longer in place. So, now Monty Hall cannot open door 1 because the contestant has already chosen door 1 and opening door 1 will abruptly end the show (as explained earlier). Now, the probability of Monty opening door 3 is 50% or 1/2 because the constraint of the car is gone, and Monty Hall needs to choose 1 out of 2 doors (Doors 2 and 3).

With the above, we will have:

\[Pr(car\ in\ 2|my\ choice\ 1,\ monty\ opened\ 3) = \frac{Pr(my\ choice\ 1,\ monty\ opened\ 3|car\ in\ 2) . Pr (car\ in\ 2)}{Pr(my\ choice\ 1,\ monty\ opened\ 3)}\] \[Pr(car\ in\ 2|my\ choice\ 1,\ monty\ opened\ 3) = \frac{1 . \frac{1}{3}}{\frac{1}{2}}\] \[Pr(car\ in\ 2|my\ choice\ 1,\ monty\ opened\ 3) = \frac{2}{3}\]

Equivalently, we can calculate the probability of the car being behind Door 1 as simply 1 - 2/3 = 1/3. Hence, the contestant should switch, as they have a 66.67% probability of the car being behind Door 2.

Given that Steven Pinker’s book is about thinking styles, I realised that I attempt to approach most problems quite mechanically. I observed this pattern while solving the other puzzles in the book, where I typically try to abstract the problem into a logical/math domain and apply established/known results to answer the question, rather than using physical intuition. Quiet often this style is described as dry and procedural, as opposed to Pinker’s lucid and intuitive style, but there is sometimes merit in dryness.

Mathematics is a game played according to certain simple rules with meaningless marks on paper - David Hilbert

Overleaf submission to arXiV

2024-01-17T00:00:00+00:00

After scouring through the entire web for close to 5 years and giving up out of sheer frustration every time, I was finally able to upload my TeX sources to arXiv (as of 17.01.2024). The solution was buried as a comment to one of the lesser upvoted answers in a less popular StackExchange thread. I hope Google will push this post high up so that it can help Overleaf (and minted) users.

Compile with \usepackage[finalizecache,cachedir=.]{minted}
Go to logs and output files > other logs and files and download everything with pyg. (all the *.pygtex and the *.pygfile)
Compile now with \usepackage[frozencache,cachedir=.]{minted}
Download the complete project .zip by submitting it to arXiV
Unzip and put all the *.pygtex files and the *.pygfile into the folder
Zip again and upload to arXiv; Works for me!

Answer source (User: rhombidodecahedron)

Opinion piece on Capitalism and Communism

2021-07-19T00:00:00+00:00

Recently in a private discussion forum, there was a topic under consideration which went something like this:

“Capitalism is pregnant with communism. The next country to become a communist state will be the United States.”

I wrote a slightly descriptive answer to this and feel the response might be worth sharing. I am simply copy-pasting my answer from the forum verbatim below.

“Capitalism is pregnant with communism. The next country to become a communist state will be the United States.”
Any thoughts on it?

Sadly, I would have to (mostly) disagree with both of these statements. Especially I believe if any of the nations were to approach a Communist-style of governance, the US will most likely be the last country to adopt it in its entirety. But this thought is based on assuming that the author is talking about a purely Marxist brand of communism.

Would America have state-controlled production? I cannot see that in the near future. On the other hand, if the implication was that there will be influences of certain ideas born and bred in the Communist and Socialist schools, well they are already there in place. For eg, the US has antitrust laws in place which are fundamentally incompatible with a pure laissez-faire capitalist system. An example of it being used in action is the dissolution of the AT&T corporation in 1982[1] owing to it becoming a monopoly in telephone services.

On a much smaller scale, an example of a centralised control regulating a free market is the recent fiasco with the Gamestop stock price skyrocketing and then Robinhood delisting the stocks from being traded. There were lots of complaints and criticism towards Robinhood broadly for:

not allowing the free market to thrive and
decapitating small individual investment parties.

and Robinhood was labelled as a giant capitalist bully. The irony here is that if you read these two points again and carefully, you will realise that most Marxist schools strive exactly for outcome 1 and partially for outcome 2. So it would seem that the lines between the two ideologies are slightly blurred in the more modern contexts.

Marx has criticised the liberty of an individual within a bourgeois society[2] as “the liberty of man viewed as an isolated monad, withdrawn into himself. […] The practical application of the right of liberty is the right of private property”. While phrased very sharply, this could be a slippery slope. In most implementations of Marxist communism, the value of an individual is severely diminished at the cost of the community (or society or state). Now anybody who has remotely been engaged with any form of creative expression (like art, music, poetry, novels, cinema) will recognize the dangers of diminishing the power of an individual.

On the other hand unchecked capitalism, as perhaps we all know, can lead to the commodification of, well, everything! Furthermore, perhaps, the commodification of an individual as well! This is partly the reason that antitrust laws exist in even the most neoliberal countries (like the US and UK). This is why corporations like Amazon, Microsoft, Oracle get repeatedly being hit by lawsuits (especially in the more left-leaning Europe).

While I personally do not understand economics very well and work very little towards bridging that gap (mostly because my topic of PhD is a very different subject :)), I do keep an active interest in education and wonder what the outcomes of Capitalistic and Communist policies are (or would be) on our Universities and academic institutions. The American model of private universities has led to a huge student-loan debt in that country and I can see the flurry of private universities rising in India following that same path. On the other hand, European countries seemed to have done surprisingly well with their state institutions, with free education and unimaginable policies like paying the students instead to attend colleges (Sweden).

I wonder what needs to be done in India to emulate the Europen model. It is a complex question. With such a vast population of youth in our country (isn’t India’s average age close to 25?) we want education to spread fast. If we over-rely on the European model of prestigious state institutions, it gives rise to a select class of elite institutes like the IITs - which are notoriously hard to get into and the entrance exams cause an unprecedented amount of mental turmoil within the youth. If on the other hand, we promote private universities within a free market economy, we inevitably gravitate towards the American model and its outcome of insurmountable student loans.

It seems, under all circumstances, the mass of students suffers. How do we cope with this? What is the ideal solution? I have no answers but only questions.

[1] Breakup of the Bell System
[2] Marx, Karl. Karl Marx: selected writings.

Unfolding Fix

2019-12-20T00:00:00+00:00

In pure lambda calculus (untyped or typed) the fix combinator is used to encode recursion. fix represents the least fixed point of a function f i.e the least defined x for which f x = x. An important concept is that a function need not have a least fixed point. But for those functions which do, the least fixed point is the base case for recursion. An intuition about the least fixed point is :

f (f (f ( f x))) = x

This is a convenient way to represent recursion in a language which doesn’t support naming functions. The base case of the recursion is determined by the actual least point of the function. Here we will take the most popular example of encoding fix which is a factorial function :

factorial n = if n <= 1 
              then 1
              else n * factorial (n - 1)

and demonstrate the stack trace of how the fix function unfolds this recursion. First lets define fix in Haskell.

fix :: (a -> a) -> a
fix f = f (fix f)

Owing to the laziness of Haskell we can define such an expression which doesn’t fully evaluate the fix f call in the right hand side but rather creates a thunk.

Now writing the factorial function using fix :

factorial = fix (\fac n -> 
                    if n <= 1 
                    then 1 
                    else n * fac (n-1))

factorial x = 
  (fix (\fac n -> 
          if n <= 1 
          then 1 
          else n * fac (n-1))) x

Lets try calculating factorial 3

Unfolding:

(f (fix f)) 3

-- substituting f

((\fac n -> if n <= 1 
            then 1 
            else n * fac (n-1)) 
            (fix (\fac n -> 
                    if n <= 1 
                    then 1 
                    else n * fac (n-1)))) 
3

-- beta reduction

(\n -> if n <= 1
       then 1
       else n * ((fix (\fac n -> 
                         if n <= 1 
                         then 1 
                         else n * fac (n-1))) (n - 1))
) 3

-- beta reduction

if 3 <= 1
then 1
else 3 * ((fix (\fac n -> 
                 if n <= 1 
                 then 1 
                 else n * fac (n-1))) 2)

-- evaluating if

3 * ((fix (\fac n -> 
                 if n <= 1 
                 then 1 
                 else n * fac (n-1))) 2)

-- substituting f

3 * 
(\fac n -> if n <= 1 
           then 1 
           else n * fac (n-1)) 
           (fix (\fac n -> 
                   if n <= 1 
                   then 1 
                   else n * fac (n-1))) 
 2)

-- beta reduction

3 *
((\n -> if n <= 1
       then 1
       else n * (fix (\fac n -> 
                        if n <= 1 
                        then 1 
                        else n * fac (n-1)) (n - 1))) 2)

-- beta reduction

3 *
if 2 <= 1
then 1
else 2 * (fix (\fac n -> 
                 if n <= 1 
                 then 1 
                 else n * fac (n-1)) 1)

-- evaluating if

3 * 2 * (fix (\fac n -> 
                if n <= 1 
                then 1 
                else n * fac (n-1)) 1)

-- substituting f

3 * 2 *
(((\fac n -> 
     if n <= 1 
     then 1 
     else n * fac (n-1)) 
     (fix (\fac n -> 
             if n <= 1 
             then 1 
             else n * fac (n-1)))) 1)

-- beta reduction

3 * 2 *
((\n -> if n <= 1
       then 1
       else n * ((fix (\fac n -> 
                         if n <= 1 
                         then 1 
                         else n * fac (n-1)) ) (n - 1))) 1)

-- beta reduction

3 * 2 * 
if 1 <= 1
then 1 -- hit the base case
else n * ((fix (\fac n -> 
                  if n <= 1 
                  then 1 
                  else n * fac (n-1)) ) (n - 1)))

-- evaluating if

3 * 2 * 1 = 6

Persistent Red Black Trees in Haskell

2017-10-28T00:00:00+00:00

While Haskell is steadily gaining mainstream adoption in the industry, it still remains one of the most viable languages used as a teaching medium. Even if we strip the fancy type level features in Haskell, algebraic data types and pattern matching are quite expressive enough to represent a lot of ideas.

In this post I will be looking at the construction and operations of Red Black trees. Of special interest here would be the deletion of nodes, as the operation of delete is inherently opposed to Haskell’s fundamentals of immutability. We will look at these various operations as creating a new version of the tree rather than mutating an existing version, which gives the persistent quality to our data structure.

This post doesn’t use any fancy type level features of Haskell(except at the very end) and assumes only basic familiarity with the language. However the deletion operation is quite involved and requires the reader to be fairly meticulous so as to not miss any of the possible cases.

A red black tree is a type of a binary search tree with the ability to self balance itself. This property of self balancing is highly desirable as a plain binary search tree reduces to O(n) worst case time complexity for search, insert and delete operations. The balancing nature of red black tree gives it a worst case time complexity of O(log n). It is not truly balanced, in the sense that the heights of the various subtrees can differ, but the height of the longest subtree would be a maximum of 2log(n+1), which effectively gives it a balanced nature.

From a practical perspective, it is not expected of you to code out a fully functional red black tree every other day, but it is helpful to know the implementations and understand the trade-offs made so that performance analysis on some critical code can be made.

Notably, TreeMap from the Java collections library is implemented using a Red Black Tree(however it is not a persistent implementation). In practice, most implementations of maps, sets and other useful structures are implemented using balanced binary search trees. Hence without any further ado lets jump into the datatypes.

Defining the tree

We will start by defining 2 colors red and black and use that as a metadata in our actual tree type.

data Color = R | B deriving Show

data Tree a = E | T Color (Tree a) a (Tree a) deriving Show

Now for a red black tree to be balanced it needs to follow a set of invariants. These invariants technically can be encoded into the Haskell type system but to keep the implementation simpler we define it using functions and verify them at runtime. The invariants are:

No red node has a red parent or a red node has only black children.
Every path from the root node to an empty node contains the same number of black nodes.
The root and leaves of the tree are black.

Take some time to internalize these invariants, as they should not be broken under any circumstances. Throughout the rest of the article, these invariants are referred to a number of times by just referring to their serial number. We can relax some of them locally in certain cases but we make sure some other functions at the global level handles the violation and rectifies it. We are going to start by implementing the member function, which basically tells us if the element belongs to the tree or not. This is not very different from a regular BST.

member :: (Ord a) => a -> Tree a -> Bool
member x E    = False
member x (T _ a y b)
  | x < y     = member x a
  | x == y    = True
  | otherwise = member x b

This should be self explanatory. If we find the element we report True otherwise we recurse down the left or right subtree.

Insertion

Now lets look at one of the more involved operations: insert.

The insertion operation was described by Chris Okasaki in his classic book “Purely Functional Data Structures”. The insertion of a node is colored red in the beginning so as not to affect the height. However this might end up violating the first invariant. To restore the first invariant we have to balance the tree recursively. The function looks something like this:

insert :: (Ord a) => a -> Tree a -> Tree a
insert x s = makeBlack $ ins s
  where ins E  = T R E x E
        ins (T color a y b)
          | x < y  = balance color (ins a) y b
          | x == y = T color a y b
          | x > y  = balance color a y (ins b)
        makeBlack (T _ a y b) = T B a y b

The points to notice here in this definition are the functions makeBlack and balance. Minus those functions the insertion is exactly similar to an insert in a BST. The makeBlack function is also a fairly simple function, given a Node it changes the color of the node to black irrespective of the node’s color. The purpose of this function is that, after applying balance recursively the final tree might have 2 consecutive red nodes at the top of the tree. This would be a violation of invariant 1. By blackening the root we restore invariant 1 as well as we invariant 2 stays intact, as only the root of the tree gets colored. A more apt name for the function, perhaps should have been blackenRoot. However this is quite simple.

The trick is now writing and understanding the balance function. Now we know that the insertion might have violated invariant 1 and as a result of which it might have created a tree with 2 consecutive red nodes. So all we have to think is, given the original balanced tree with no violations, what are the possible ways in which a red node might have been inserted which breaks the invariant 1. Let us see: (Due to lack of a red pen I am using a blue pen to represent red nodes. So this is technically a blue black tree. But you get the point):

Now algebraic data types and pattern matching makes it very easy to express each case. So if we write out the tree structure as demonstrated in the figure, the 4 cases would look like this:

T B (T R (T R a x b) y c) z d  
T B (T R a x (T R b y c)) z d  
T B a x (T R (T R b y c) z d)  
T B a x (T R b y (T R c z d)) 

So all we have to do is find a way to balance each of the cases. As it is given in the figure, the solution to balancing each of the 4 cases is the exact same. By using this transformation, the tree restores the invariant 1. And invariant 2 stays intact too(Invariant 3 is almost always intact and easiest to enforce). So what happens for the other configurations of the tree? We return the nodes intact. So writing the same in Haskell:

balance :: Color -> Tree a -> a -> Tree a -> Tree a
balance B (T R (T R a x b) y c) z d = T R (T B a x b) y (T B c z d)
balance B (T R a x (T R b y c)) z d = T R (T B a x b) y (T B c z d)
balance B a x (T R (T R b y c) z d) = T R (T B a x b) y (T B c z d)
balance B a x (T R b y (T R c z d)) = T R (T B a x b) y (T B c z d)
balance color a x b = T color a x b

Languages like OCaml and SML support something called OR patterns which make it even simpler to write the function definition where multiple patterns have the same answer. And in fact there is a GHC proposal in progress about adding OR patterns to Haskell. This implementation is much more readable, expressive and most importantly intuitive compared to any other imperative language implementations of the same idea. Writing a red black tree balancing algorithm is a big deal in such languages but with ADTs and pattern matching it literally begs to follow the diagram given above.

Deletion

Now moving on to the delete operation. This one is lot more involved and we will do it step by step.

First a deletion followed by any kind of balancing has the possibility of bubbling a red node to the top, so we need the makeBlack function that we used in the insert function, followed by that we can call an auxiliary del function which will effectively delete the node and balance the tree. In Haskell:

delete :: (Ord a) => a -> Tree a -> Tree a
delete x t = makeBlack $ del x t
  where makeBlack (T _ a y b) = T B a y b

This is fairly simple. Let us explore the del function in depth.

So, in case of the insert function, the balancing of the trees, conveniently unified into a single transformation but that is not the case for delete. The cases of balancing are different but symmetric to each other for the left and right subtrees and we have to handle the 2 cases separately. So we will declare 2 separate function delL and delR for the left and right subtree respectively. And what about the case when we actually arrive at the node which we want to delete? In that case we remove that node and fuse the left and right subtree together. We will look at the fuse function in detail at a later part of this article. So writing the del function:

del :: (Ord a) => a -> Tree a -> Tree a
del x t@(T _ l y r)
  | x < y = delL x t
  | x > y = delR x t
  | otherwise = fuse l r

which literally translates from what we described in the earlier paragraph. So now before delving into the delL and delR function let us try to think about the balancing first. So for the corresponding delL and delR function we will have a balL and balR function which balance the left and right subtrees respectively, when one is shorter than the other. The signature of balL and balR should be dead simple. Given an unbalanced tree it recolors and balances the trees and outputs a balanced tree.

balL :: Tree a -> Tree a

balR :: Tree a -> Tree a

Now as red nodes don’t contribute to the height of the tree and given invariant 1, that red nodes can only have black child, when a deletion of a node occurs, the violation of invariant 2 can happen and the left and right subtree might not be of equal height(Remember only black nodes contribute to the height of the tree.)

balL concerns the cases where deletion has occurred from the left subtree. Hence in balL we can assume that the left subtree is shorter than the right subtree. What are the possible cases?

Case 1: Root node is black and left subtree root is red

Coloring y red and x black we increase the height of left subtree by 1 and the height of the right subtree remains unchanged and hence it gets balanced. So translating the diagram to Haskell:

balL (T B (T R t1 x t2) y t3) = T R (T B t1 x t2) y t3

Case 2: Root node is black, left subtree root is black.

If the left subtree root is black we can’t touch the left tree as it is already shorter and altering the black node would shorten the height more. We need to look at the right subtree.

So depending on the color of the root node there are 2 subcases in this:

Case 2. i. Root node is black, right subtree root is black

Coloring z as red reduces the height of the right subtree by 1 but it might end up violating invariant 1. In which case we have to call the old balance function that we used in case of invariant 1 violation. We can define a simple helper balance' which takes the entire node instead of passing the left subtree, right subtree, color etc. So this branch becomes:

balL (T B t1 y (T B t2 z t3)) = balance' (T B t1 y (T R t2 z t3))

Case 2. ii. Root node is black, right subtree root is red..

This is the most involved case. Here after rebalancing the tree the only issue is t4’s height is still n+1 which can be resolved by coloring its root red. However we need to rebalance the subtree of (t3 z t4) to resolve any violations of invariant 1.

Hence the code translates to:

balL (T B t1 y (T R (T B t2 u t3) z t4@(T B l value r))) =
  T R (T B t1 y t2) u (balance' (T B t3 z (T R l value r)))

This concludes our cases for balL. And balR is exactly symmetric to the cases of balL, except now the right subtree would be shorter. I am adding the code for that part just for reference.

balR :: Tree a -> Tree a
balR (T B t1 y (T R t2 x t3)) = T R t1 y (T B t2 x t3)
balR (T B (T B t1 z t2) y t3) = balance' (T B (T R t1 z t2) y t3)
balR (T B (T R t1@(T B l value r) z (T B t2 u t3)) y t4) =
  T R (balance' (T B (T R l value r) z t2)) u (T B t3 y t4)

Now that we have balL and balR sorted we can start thinking about the delL and delR function. The signature of these functions are simple enough, given a node and tree it returns a tree with that node removed:

delL :: (Ord a) => a -> Tree a -> Tree a

delR :: (Ord a) => a -> Tree a -> Tree a

So if you look at the cases of balancing above, balancing is required only when there is a black root node, in case the node is red we just recurse down the path. So the red case is simple:

delL x t@(T R t1 y t2) = T R (del x t1) y t2

delR x t@(T R t1 y t2) = T R t1 y (del x t2)

Now in case the root node is black we just balance the entire tree after the delete:

delL x t@(T B t1 y t2) = balL $ T B (del x t1) y t2

delR x t@(T B t1 y t2) = balR $ T B t1 y (del x t2)

And thats all, these are the possible cases of delL and delR.

So moving to the final case of the deletion, which is fusing the 2 subtrees when the node is found.

The cases are really simple when the color of 2 roots are different i.e. black and red or red and black.

which again translates very easily to the code:

fuse t1@(T B _ _ _) (T R t3 y t4) = T R (fuse t1 t3) y t4
fuse (T R t1 x t2) t3@(T B _ _ _) = T R t1 x (fuse t2 t3)

The difficulty arises when the color of the roots are same.

Consider the case when both the roots are red:

The transformation above are captured in the code below:

fuse (T R t1 x t2) (T R t3 y t4)  =
  let s = fuse t2 t3
  in case s of
       (T R s1 z s2) -> (T R (T R t1 x s1) z (T R s2 y t4))
       (T B _ _ _)   -> (T R t1 x (T R s y t4))

Any violation of invariant 1 is handled by the balance functions at the upper layers of the recursion.

Similarly the case when both roots are black:

if the top node of s is black we need to use balL because the height of the right subtree has increased. And if it is red we follow the transformation given in the figure above:

fuse (T B t1 x t2) (T B t3 y t4)  =
  let s = fuse t2 t3
  in case s of
       (T R s1 z s2) -> (T R (T B t1 x s1) z (T B s2 y t4)) -- consfusing case
       (T B s1 z s2) -> balL (T B t1 x (T B s y t4))

Thats all. Putting together the entire code for delete we have:

delete :: (Ord a) => a -> Tree a -> Tree a
delete x t = makeBlack $ del x t
  where makeBlack (T _ a y b) = T B a y b
        makeBlack E           = E

del :: (Ord a) => a -> Tree a -> Tree a
del x t@(T _ l y r)
  | x < y = delL x t
  | x > y = delR x t
  | otherwise = fuse l r

delL :: (Ord a) => a -> Tree a -> Tree a
delL x t@(T B t1 y t2) = balL $ T B (del x t1) y t2
delL x t@(T R t1 y t2) = T R (del x t1) y t2

balL :: Tree a -> Tree a
balL (T B (T R t1 x t2) y t3) = T R (T B t1 x t2) y t3
balL (T B t1 y (T B t2 z t3)) = balance' (T B t1 y (T R t2 z t3))
balL (T B t1 y (T R (T B t2 u t3) z t4@(T B l value r))) =
  T R (T B t1 y t2) u (balance' (T B t3 z (T R l value r)))

delR :: (Ord a) => a -> Tree a -> Tree a
delR x t@(T B t1 y t2) = balR $ T B t1 y (del x t2)
delR x t@(T R t1 y t2) = T R t1 y (del x t2)

balR :: Tree a -> Tree a
balR (T B t1 y (T R t2 x t3)) = T R t1 y (T B t2 x t3)
balR (T B (T B t1 z t2) y t3) = balance' (T B (T R t1 z t2) y t3)
balR (T B (T R t1@(T B l value r) z (T B t2 u t3)) y t4) =
  T R (balance' (T B (T R l value r) z t2)) u (T B t3 y t4)

fuse :: Tree a -> Tree a -> Tree a
fuse E t = t
fuse t E = t
fuse t1@(T B _ _ _) (T R t3 y t4) = T R (fuse t1 t3) y t4
fuse (T R t1 x t2) t3@(T B _ _ _) = T R t1 x (fuse t2 t3)
fuse (T R t1 x t2) (T R t3 y t4)  =
  let s = fuse t2 t3
  in case s of
       (T R s1 z s2) -> (T R (T R t1 x s1) z (T R s2 y t4))
       (T B _ _ _)   -> (T R t1 x (T R s y t4))
fuse (T B t1 x t2) (T B t3 y t4)  =
  let s = fuse t2 t3
  in case s of
       (T R s1 z s2) -> (T R (T B t1 x s1) z (T B s2 y t4))
       (T B s1 z s2) -> balL (T B t1 x (T B s y t4))

The entire code is available here.

The above algorithm was first devised by Stefan Kahrs from University of Kent (Thanks to Dr. Venanzio Capretta for explaining the cases).There is an alternate red black deletion algorithm devised by Matt Might which uses auxiliary colors to simplify the cases. He has blogged about it in great detail here.

While the case of deletion is quite involved, if you take a look at the entire code for the red black tree, its hardly 100 lines of Haskell. And it is persistent in nature by default. It would be much more difficult designing a thread safe red black tree in any other imperative language. Most importantly, when teaching someone data structures for the first time, the syntax never intrudes in the way of the logic of the program. I have been working on this as part of a course on Advanced Data Structures and Algorithms that I am taking, and using Haskell has made understanding the logic dead simple.

Type Level Trees

In addition if we want, we can encode these invariants at the type level. Writing an entire type level Red Black Tree would require another post.

Stephanie Weirich from University of Pennsylvania has done lots of work in that area and there are many of her presentations available online.

As a small taste of what we can do, we can encode the simple invariant that “The difference in height between the 2 subtrees is maximum 1” to ensure that a BST is balanced. Fot that, what we need, is to raise a number to the type level and say that “if the height of left subtree is n then the height of the right subtree is either n + 1 or n - 1 and vice versa”.

We can represent numbers simply using Peano numerals. But first we need a couple of language extensions:

{-# LANGUAGE GADTs, DataKinds  #-}

Followed by that define the Peano numerals and capture the invariant in the tree:

data Nat = Zero | Succ Nat

data T n a = NodeR (Tree n a) a (Tree (Succ n) a) -- right subtree has height + 1
           | NodeL (Tree (Succ n) a) a (Tree n a) -- left subtree has height + 1
           | Node (Tree n a) a (Tree n a)     -- both subtrees are of equal height

data Tree n a where
  Branch :: T n a -> Tree (Succ n) a
  Leaf :: Tree Zero a

We can also raise relational operators to the type level using the singletons package by Richard Eisenberg and capture further invariants in the type level. But that will be material for another post. Till then enjoy writing more Haskell :)