Why are they so intelligent? The moment we speak of intelligence, we step into the arena of philosophy. Intelligence presupposes consciousness and awareness, doesn’t it? Or does it? You see, Artificial Intelligence is fraught with philosophical possibilities.
As teachers and writers, we often distinguish among various categories or levels of knowledge, creating hierarchies like Data, Information, Knowledge, and Wisdom, with insights and creativity likely lurking between the last two. I remember using this framework while teaching text analytics, explaining to my students that as one ascends these higher orders, the information density increases. I illustrated this by showing that two numbers—the mean and standard deviation of their scores—could encapsulate the essential performance of the cohort. What I was subtly implying, of course, was that my own place in this hierarchy was closer to the Wisdom end, where highly distilled information is infused with creativity and intellect to yield a neatly packaged product: wisdom.
What large language models (LLMs) such as ChatGPT suggest, however, is quite the opposite. The process of creating intelligence—or wisdom, depending on your preference—is not one of distillation and concentration but of granulation. In fact, the entire hierarchy from Data to Wisdom may be fundamentally flawed, at worst, or misleading, at best. Allow me to explain.
In my earlier statistical example, where the mean and spread summarize the cohort’s performance, I could make the model generative. For instance, I could predict a new student’s score by assigning the mean value in the absence of any other information. Alternatively, I could draw a score randomly from a normal distribution defined by the given mean and standard deviation.
When people say that LLMs are merely “predicting the next word,” they are essentially assuming the latter: that LLMs determine the most probable next word—akin to assigning the mean score to the new student. A more nuanced practitioner might argue that the LLM generates a random word from a statistical model, much like assigning a random score based on the normal distribution. Of course, the word-prediction process is far more complex: The model is “large,” and predictions depend heavily on the context of the conversation.
To build on my toy model, I could create sub-models for specific groups—such as males and females, tall and short students, or individuals of different nationalities and backgrounds—to improve prediction accuracy. In my example, however, segmentation reduces statistical power because the data set becomes too fragmented. For language models, on the other hand, segmentation enhances accuracy. Precisely because they are “large,” LLMs do not suffer from statistical power loss. Instead, their predictions improve. In essence, the more granular the model, the better its performance. But this granularity seems to contradict the traditional Data-Information-Knowledge-Wisdom hierarchy. After all, a fully segmented model is equivalent to the data itself, isn’t it? Does this not suggest that the hierarchy is flawed?
So much for this quasi-philosophical exploration of how LLMs work. Let us now turn to why they appear so intelligent, smart, or wise—or at least, knowledgeable. Ultimately, all these terms may point to the same phenomenon.
This blog is called Unreal Blog for a reason: Reality, as we perceive it, is very unreal. What that weird statement means depends on the context. Here’s one context: If you look at the night sky, whatever you see, the stars, the galaxies etc., are all from the past. More importantly, the way we perceive motion, especially at high speeds, is completely unreal, which is the basis of special relativity. Here’s is a video explaining what I mean by that. Loving created by yours faithfully…
The transcript of the narration follows. You don’t need it because you will see it as the subtitles in the video. But in this age of Google, may be it is advisable to have it.
Here is a question we are not allowed to ask: If something were to travel faster than the speed of light, what would it look like to us? We are asking this question only a thought experiment.
An object, a star perhaps, is moving from left to right at ten times the speed of light, covering a distance of 20 light years in two years. The distance of closest approach to us is ten light years. What would it look like, to us?
The light from the object when on the left takes 14 years to get to us. But a year later, the object is directly above us. And the light from that point in space takes only 10 years to reach us. So the object would appear directly above us before it appears on the left. Or on the right. In other words, what we would see is completely different from what is happening out there. We would see an object appearing, out of nowhere, about 10 [not 11] years after it passes the point of closest approach. Then, it would appear to split and move in opposite directions.
But if any physicists hear of this thought experiment, they would say, “Nothing can travel faster than light. Therefore the question doesn’t make any sense. Even as a thought experiment, it doesn’t get off the ground.”
So let’s rephrase the question. Let’s say there is no physical object, moving in this thought experiment. It is a long, very long, strip of lights. Like, say 20 light years long. We have lights one light second apart. We have programmed them to turn on in a sequence, one every tenth of a second, giving us the illusion that the object is moving at ten times the speed of light. Nothing, no physical object, is moving. There is only one frame of reference. Are we okay now?
Even this scenario may face some objection. Some cautious physicists may say that it is not merely physical objects that are banned from breaking the light barrier. No information can be transmitted faster than the speed of light either. Therefore, the lights cannot be turned on in the sequence as described because the clocks implied in this scenario cannot be synchronized over such distances.
So let’s actually do all the synchronization and programming of the lights all in one point in space, and then move them into position. Yes, moving them, accelerating and decelerating, will destroy the synchronization. Let’s say that we know the velocity profile of each clock. We can pre-compute and pre-correct all the time dilation effects and get the whole experiment set up. Are we good now?
For the ease of description, we are going to that an object is flying by, rather than some lights are being turned on.
Having done all this thought work, let’s run this experiment once again, more carefully this time. As the object is flying by, it emits lights. The light coming toward us, from different points in the path of the object, takes different amounts of time to reach us. The instant in time when the light from a point reaches us is when we see the object at that point in space. The first instant any light from the object reaches us is when it is near the point of closest approach. Let’s call this point the core. So what we see is the object appearing at the core, then splitting into two, and then two objects (let’s call them the phantom objects) moving away from each other, rapidly at first and then slowing down. Our best interpretation would be that there was an explosion at the core and the phantoms are the fragments.
Now the question is, are there such, loosely symmetric objects in the cosmos? There are. They are the radio lobes associated with the so-called Active Galactic Nuclei or AGN. Their common features are a core region, thought to be a host galaxy, and a pair of much larger, roughly symmetric lobes that appear when viewed in the radio frequency ranges.
In our animation, we have drawn the phantom objects with colors turning from blue to red. It is not an accident or aesthetic license. The wavelength of the light reaching us is indeed modified because of timings and the superposition of the waves. Let’s look at it once more, this time with the advancing wavefronts included. This picture, in fact, is identical to the sonic boom in supersonic motion, with the so-called Mach cone. We can see that at the first instant in time when we see the object, it is the surface of expanding cone that passes over us. The frequency at that point is infinity. Right after that, the frequency quickly drops to gamma rays, x-rays, through the visible spectrum and onto microwaves and radio waves. What we are experiencing is indeed a “luminal boom.”
Now, are there beasts like this in our universe? Yes, they are the Gamma Ray Bursts, or GRB. Discovered in the sixties (accidentally, when looking for the gamma ray signatures of enemy nuclear tests), they appear at random points in space. The gamma ray emission lasts only for a short time, and then it quickly changes into X-ray and an optical afterglow. And they do show a correlation with AGNs.
What we have come up with is a model for Gamma Ray Bursts and the radio lobes associated with Active Galactic Nuclei. But with a fatal flaw: the model is based on superluminal motion, which is not allowed. Technically, it violates Lorentz Invariance. But remember, we did not have any real motion in our thought experiment or the animations. It was only a strip of lights being turned on in rapid succession. What creates the phantom objects is just the sequence in which the light from different points in the strip reaches us. And what creates the GRB-like effect and its evolution to AGNs is the squeezing and spreading of the time intervals between successive wavefronts.
There are indeed other models that explain such phenomena as Gamma Ray Bursts (GRB) and Active Galactic Nuclei (AGN), staying well within the bounds of special relativity.
Now we have a big question to answer. When we actually observe a phenomenon (GRB, for instance), are we supposed to peek behind what we see, and ask what is “really” happening? What is the “real” reality? Let’s draw a distinction between our perception and the causes behind them: Perceived Reality is the way we see things, like GRBs etc. The Absolute Reality is what is really going on – it may be a luminal boom, or may be something else. Which one does physics describe? The unequivocal answer from most of my physicist friends is the latter: Clearly it is the Absolute Reality. No question. But is it though?
It is in the perceived reality that we have space that contracts and time that dilates because of the finite speed of light. We can see it in the speed of the phantoms in our animation, which is much slower than that of the original object. We can also see that for the left phantom, the flow of time is reversed: We see later events first, and then the earlier once. Effects first and then the causes. Causality is indeed violated due to faster than light travel, but only in our perception, not in the absolute reality.
When we have space that contracts and time that dilates, we have to ask the question: What are they? What is space? What is time? The master himself had the answer.
Space is a cognitive model, a mental picture, created by our brain based on the electrical signals it gets from our senses. The signals themselves are created by the light falling on our retinas, or on the CCDs or films of our telescopes. Any limitation in that chain (from absolute reality transported by the information carrier, which is light, to our sensing apparatus) will have a manifestation on our cognitive model, which is space. Do we ascribe the effects of such limitations (the finite speed of light, for instance) to the properties of the cognitive model? Do we even have access to anything other than this model?
If it is the perceived reality (the space and time as we perceive) that our theories are describing, then the constraints in the chain of perception (the finite speed of light, for one) manifest themselves as its properties. If a blind bat were to create a theory of relativity based on the motion of bugs it echolocates in space, the speed of sound would become a primary property of its space. The bat would see, for instance, that nothing can fly away from it faster than the speed of sound. Clearly, that’s only in its perceived reality.
And, in a space created and theorized out of the light falling on our retinas or telescopes, is it a surprise that its speed is a fundamental constant? And that nothing can travel faster?
But if it is the absolute reality we are trying to describe, we have a much bigger problem. We have no clue what it is. In our little thought experiment, we could have an infinity of models generating the phantom objects. We could rule out some of the models based on our understanding (like breaching the speed of light, or the current theories about astrophysical phenomena). But at the heart of it, it is a model-based extrapolation from the projection of an unknowable reality into our sensory space. Nothing is as it looks like, in our universe. Our reality is, truly and completely, unreal.
So let’s ask the big question once more: Which reality does physics describe? The cognitive model that is our perceived reality? Or the absolute reality that houses the causes behind our perception?
It all sounds as though I am another one of those crackpots who want to prove Einstein wrong, doesn’t it? Well, I am not, really. But it will take a few more videos to fully explain why.
Let’s wrap up this video with a quote from another master:
“We are only at the beginning of the development of the human race, of the development of the human mind, of intelligent life — we have years and years in the future. It is our responsibility not to give the answer today as to what it is all about, to drive everybody down in that direction and to say: ‘This is a solution to it all,’ because we will be chained then to the limits of our present imagination.”
Richard Feynman
Now, tell me: Would you like to take the blue pill or the red pill?
Life and death has been a recurring theme on my blog. Confronted with our mortality, a common stance we assume is one of anger. Hearing of such a stance recently, I thought I would expand on my notion of gratitude in this writeup, liberally paraphrased from Shelly Kagan’s lectures on this subject.
Gratitude is best described in mystical terms, where we have a generous, benevolent giver (namely God) and a receiver (such as ourselves). A mystic poem that Kagan quoted goes like this (paraphrasing again, of course): God was a bit bored, so he created the universe and all the beauty in it, like the sun and the stars, beaches and mountains, forests and lakes, snow and waterfalls, and so on. At the end of this creation, God wanted an audience. So he looked at some mud on the ground and said, “Sit up and see all this beauty that I have created.” And I sat up and looked. Then I saw. I saw the beauty, not only in love and life and pleasure and happiness and everything nice and great, but also in loss and grief and misery and struggles, in all things bad and mean as well.
I cannot even begin to tell you how grateful I am that I got to be the mud that sat up and saw it all. All this beauty. So much of it that it hurts if we allow ourselves to see it. I got to experience the pleasures and the pride, and the pangs and the anguish. I got a glimpse of God’s own thoughts, written in these immense volumes of beauty. Imagine, if my parents had gotten amorous a minute earlier or later, I wouldn’t have been, and all this beauty would have been lost to me. How can I be anything but grateful for this singular fortune, this supreme gift?
What does it matter that my awareness of all this beauty will cease in 20 or so years? Or tomorrow? I see it now. My experience at this point in time is etched in eternity. It is mine. For now. And for ever.
This little bit about eternity is my dim understanding of an old song, but it is also an oblique commentary on the different outlooks of life. The western outlook is that life is a gift to be appreciated, a container to be filled with as many great things that we can muster in this short blink during which it lasts.
But we, of the East, beg to differ. We view life as a burden or suffering (as in Buddhism), or as a difficult patch in the cycle of life and death. We deal with it by not getting too attached to life and its pleasures.
When I say “we,” I am not sure I include myself in it. Well, may be I do. I see the beauty in detachment as well, in actions performed devoid of any attachment to their fruits or glory, in kindness for its own sake, in a life lived to its fullest, but oriented toward a salvation that is the very antithesis of life. I see beauty in our petty fights and our noble gestures, in our worldly worries and our heavenly pursuits. In everything that adds a little piece to this grand collage, a little square to this magnificent Persian rug, a little shade to this dome of many-colored glass, staining the white radiance of eternity. And I am grateful that I get to see it all.
When I was about 15, oh so long ago now, I had this crazy hobby of book binding, which is like the process of turning a paperback into a hardcover, or adding a hardcover to an exercise book. With the mild OCD that I have, I do get a bit carried away with such things, and no books around me or in my dad’s collection were spared. I collaborated with a local printing press to access their cutting machine and local stationery stores to research on various techniques and acquire supplies. My crowning moment was when I did a “full-calico” binding on a rather useless book that my dad had recently purchased. Continue reading →
My mom used to say that when your child is as big as you, you have to treat them with respect. What she actually said was that you had to address them using a respectful form of “you,” which doesn’t make any sense in English, but may work in Hindi or French. It worked poetically well in Malayalam. I was reminded of this maternal pearl of wisdom recently when I was watching a movie with my son.
I am attending a research retreat on mindfulness and contemplative practices at the beautiful Garrison Institute. I am learning a lot of interesting things, and meeting a lot of like-minded and excellent people – the kind of people with whom I could have deep conversation about the unreal nature of reality, unlike most people from other walks of life would politely and tactfully excuse themselves when I get a bit unreal.
At some point in our life, we come to accept the fact we are closer to death than life. What lies ahead is definitely less significant than what is left behind. These are the twilight years, and I have come to accept them. With darkness descending over the horizons, and the long shadows of misspent years and evaded human conditions slithering all around me, I peer into the void, into an eternity of silence and dreamlessness. It is almost time.
Since the post series on Particles and Interactions became a bit longer than I wanted, I thought I would break it up. Let’s start with a recap of modern physics that will you will need to understand the structure of matter.