Nikhil Kamath ft. Perplexity CEO, Aravind Srinivas | Research Hub | Research Hub - Ekamoira

Hightutorial

Nikhil Kamath ft. Perplexity CEO, Aravind Srinivas

Nikhil Kamath840K views2h 16mMarch 23, 2025

0:002:16:31

Key quotesDrag to seek

Transcript

Nikhil Kamath22,392 words

⌘F

Copy the formatted transcript to paste into ChatGPT or Claude for analysis

0:00

Are you like a Chennai boy? Have you

0:01

grown up there all your life? My parents

0:03

live in Chennai, so I first go there.

0:06

What were these fancy ideas? I'm very

0:08

upset to hear that cuz I actually

0:10

thought my ideas are cool.

0:17

[Music]

0:44

Hi Arvind. Hi Nikico. Hi. This is a bit

0:49

weird for me cuz I'm doing

0:51

it this way. a conversation after a bit.

0:56

Okay. Yeah. Yeah. I wish we could be in

0:58

the same place.

1:01

Where are you now? I'm in San Francisco,

1:05

right? Yeah. I was traveling to Europe

1:07

last week. Okay. A lot more travel

1:11

coming up, but um hope to be in India

1:13

pretty soon. Um before my May hopefully.

1:17

Well, that's not far. When you come to

1:19

India, where do you typically go to? Is

1:21

it my parents live in Chennai so I first

1:25

go there and then um depending on the

1:29

arrangement like who I'm meeting I spent

1:32

like last time I came I went to Mumbai

1:34

and Delhi

1:36

um and this time I probably will try to

1:40

go to Bangalore too in addition to

1:44

um these two cities. Mhm. But

1:48

everything's like in the flux right now.

1:51

Right. Super. Are you like a Chennai

1:54

boy? Have you grown up there all your

1:55

life? Yeah.

1:58

So, you know, like the local stuff about

2:02

Chennai kind of thing.

2:04

Um, I mean, I'm

2:06

I'm Yeah, I grew up there, so I hope I I

2:10

don't know exactly what you mean by

2:11

local, but I definitely know Chennai

2:13

pretty well.

2:15

Right. How did this begin? Would you

2:17

like to start by telling us like a

2:19

little bit of little bit about your

2:22

journey how it was from where you began

2:25

in Chennai to where you are today? Yeah.

2:30

Um well I was just like any other um

2:34

student in Chennai. Um just just

2:36

studying people in Chennai study a lot.

2:38

I think that's one thing I've known.

2:41

Um I think I was pretty interested in

2:45

um all all sorts of statistical things.

2:48

Um mainly coming from following cricket

2:52

a lot is people generally like try to

2:55

analyze the stats and run rate and like

2:59

how many 50s or 100s and I I got a

3:01

intuitive sense for numbers pretty early

3:03

on. I was pretty good at math. Um and

3:07

also like early on picked up programming

3:11

towards the end of my um I think 11th

3:14

standard. So that's that was how it

3:17

began and obviously my mom wanted me to

3:19

get into IITs

3:21

uh every time like we would go on a

3:25

bus and and pass by the IIT Madras

3:29

campus

3:30

uh my mom would point to the campus and

3:33

say this is where you're going to study.

3:35

that it's not even like you should

3:37

study. This is where you're going to

3:38

study. So that was the expectation. I

3:42

definitely grew up thinking that okay

3:46

um I I do want to like compete and win

3:49

against the best people.

3:51

uh and um it was the J exams are pretty

3:55

competitive as you know and so we we had

3:59

a pretty uh good rivalry among like

4:02

fellow students and I obviously didn't

4:04

do as well as I wanted in the J but uh I

4:08

got into IT got into electrical

4:10

engineering and inside there inside

4:13

their

4:14

campus again like uh a lot of our

4:17

friends got into competitive programming

4:20

so I I goes into that too, but I I

4:23

figure I was not as good as what you

4:26

needed to be to get to the world finals

4:28

of like ICPC or something. Um, and so I

4:32

I I got a good understanding of computer

4:34

science. Uh, and and and got a lucky

4:38

opportunity to learn about machine

4:40

learning pretty early on. Um, I got, you

4:43

know, a roommate of mine, uh, or like

4:46

someone neighboring to my room told me

4:48

about this contest that was running in

4:50

Kaggle,

4:51

uh, where he just had a data set and you

4:53

had to predict stuff. And I had no clue

4:56

what any of these meant. They were just

4:58

a bunch of numbers you downloaded and

5:00

like, you know, figure out a prediction

5:02

classifier for like unseen outputs. And,

5:05

um, that's when I got into things like

5:09

scikitlearn, which was a very famous

5:10

machine learning library. And it just

5:13

randomly tried all these algorithms

5:16

uh mixed and matched them. That got me

5:19

that one that helped me win the contest.

5:21

And then from there onwards I thought

5:24

okay like maybe I should take this more

5:27

seriously. I got an internship in a

5:28

startup at Bangalore

5:30

um and and built recommener systems for

5:33

them pretty quickly actually. So my

5:35

internship which is supposed to be 2 and

5:36

a half months I finished it in like 3

5:39

weeks.

5:40

uh they they submitted my solution to

5:42

their client got the money so they were

5:45

happy uh and so I got a lot of time to

5:48

just sit in like learn ML but you know

5:52

come to the intern office and just learn

5:56

so that's what I did I I I I selftaught

5:58

myself all the endrewing lectures and um

6:02

did all the Stanford materials went back

6:05

to campus uh took the machine learning

6:08

class there started doing research Arch

6:10

that got me a PhD in Berkeley. Uh and

6:13

then like from there I got an internship

6:15

at OpenAI, Deep Mind. Um built my

6:19

fundamentals. Um and and obviously every

6:21

stage I got better and better peer group

6:24

you know exposure. Uh always question

6:28

whether my understanding of the world

6:29

was correct or not. Um I felt

6:32

comfortable being like not the best

6:34

person in the room. you know that that

6:35

that takes time because uh the IAT

6:38

mindset is like I want I want to be the

6:40

best smartest person in the room. uh

6:42

right like you're always I think after

6:45

coming here I learned it's okay to not

6:46

be the smartest person it's okay to be

6:48

the person that wants to be the like

6:51

wants to learn everything and and and

6:53

and and learn from the best and I think

6:56

um that was very formative once I came

6:58

here because when I when I actually went

7:00

for my internship at OpenAI um like it

7:03

was truly like humbling like I was very

7:06

very bad compared to the people there

7:07

and I thought I was good so uh those

7:10

were the years when I uh really learned

7:13

uh the details of AI and machine

7:16

learning and I think it's really helped

7:18

me to come to where I am right now. What

7:21

years were these are at open eye? Uh so

7:24

my summer internship was in 2018. So the

7:27

way it happened is I came to Berkeley. I

7:29

didn't have an

7:31

advisor. So um you know when you don't

7:34

have an advisor they give you like a lab

7:35

space that was very small like somewhere

7:38

in the corner. uh and and so that was

7:41

not very uh stimulating to go and work

7:45

every day. Um but again, I'm not a

7:47

person who tries to blame things. So I

7:50

just go to the Phil's coffee here every

7:53

day morning. I would go wake up at 5:30

7:55

a.m. I would be there. I be the first

7:58

person in Phil's coffee and I would

7:59

leave at 8:00 p.m. in the evening. Uh

8:01

and I would just work all like every

8:04

single hour. I would just work. I you

8:06

know because I didn't have my own

8:08

computer like like to to uh do my

8:11

research I learned how to use the cloud

8:14

uh and just work from my laptop and all

8:16

these things were helpful and I wrote a

8:19

paper pretty quickly then that got me my

8:21

adviser Peter and his student was uh

8:24

John Schulman who was one of the open

8:27

AAI co-founders and uh he famously went

8:30

on to create Chad GPT so that guy

8:33

invited me um because he noticed my work

8:36

and and we we were off in the same lab.

8:38

So he he he noticed my work and he

8:40

invited me for an internship and um um

8:44

that's when I went to OpenAI and like

8:46

Ilia Sutsker was essentially running the

8:48

company at the time. Uh there was no Sam

8:51

Alman or um I think Elon Musk was on his

8:54

way out too. So Ilia listens to me for

8:57

like half a minute about my ideas and he

9:00

just says uh you're wrong like all your

9:03

ideas are useless. uh and not in a way

9:06

where it's like arrogant. He's just like

9:08

literally just telling me the truth in a

9:10

very respectful way and uh I'm very

9:13

upset to hear that cuz I actually

9:15

thought my ideas are cool and that's

9:18

what I was being told by other people on

9:20

campus. So um I then go on to question

9:24

and he just tells me AI is just two

9:26

circles and he draws a big circle uh and

9:29

and then inside that he draws a smaller

9:31

circle and he said the big circle is uh

9:35

generative AI and the smaller circle is

9:39

reinforcement learning

9:41

RL and together this is the recipe for

9:45

making AI happen AGI and the only thing

9:49

that remains is to throw a lot compute

9:51

at it and he said this in 2018 and I was

9:54

working on very fancy ideas that you

9:57

know were like made me feel smart but

9:59

not necessarily that mattered long run.

10:02

What were these fancy ideas? Um I was

10:05

trying to work on things where the AI

10:08

would learn its own loss function. So

10:11

obviously the thing in AI is like

10:13

there's something called the loss

10:14

function. That's what the neural network

10:16

optimizes, right? Um, so when when you

10:20

um when you're trying to build

10:22

intelligence, you don't actually know

10:24

what is the real loss function, right?

10:27

You could say intelligence emerges from

10:30

predicting the next word of from the

10:32

previous word. But someone could say why

10:34

why is that a sign of intelligence too?

10:36

You could say intelligence comes from

10:38

identifying thousand breeds of cats and

10:40

dogs and like you know all the images

10:42

you just assign a label to it and

10:44

predict a label. But you could say okay

10:46

wait uh humans don't learn like that. So

10:49

there's there's no one magic sauce for

10:52

building a generally intelligent model

10:54

in AI. Uh you can design objective

10:57

functions that are narrow in nature like

10:59

oh like you go and master the game of go

11:02

or chess or like be the best uh object

11:05

detector on the planet. Uh but these are

11:08

not going to lead to general

11:09

intelligence. So I was trying to work on

11:11

research where uh the AI comes up its

11:14

own with its own loss function um uh

11:19

trains on it and then evaluates itself

11:21

on a bunch of tasks and then decides

11:24

okay it has to go and tweak the loss

11:26

function a little bit to be better at

11:28

like more tasks and then I keep doing

11:30

this iteratively and then I thought

11:32

that's that in that loop intelligence

11:34

will emerge. this is a good idea, right?

11:37

But but um Ilia just said this is too

11:40

complicated. I think from the beginning

11:43

mainly my main takeaway from OpenAI

11:45

internship was like even though uh other

11:49

people in academia who are like the

11:51

elites will respect you for the more

11:53

complicated

11:54

ideas, what matters in reality is making

11:58

things work. And it's often the simplest

12:01

ideas uh that work in practice

12:04

especially when uh thrown a lot of

12:07

compute at them. The simplest ideas

12:09

typically outshine the complicated ones.

12:13

So when we talk about AI today, let me

12:16

like set

12:17

context. Think of me as an absolute

12:20

idiot who does not understand anything.

12:24

And whenever you say something, if

12:27

possible, please try and explain it to

12:29

me in the manner that you would speak to

12:31

like a 10-year-old boy who's not very

12:34

smart. That would help. Sure.

12:36

Absolutely.

12:39

And I think a good place to

12:43

start today where I am, I work in

12:46

fintech largely in India.

12:48

But I feel whenever I read the news or I

12:52

watch the news very insecure about the

12:56

fact that so much is happening in AI

13:00

and I almost feel like I'm being left

13:05

out of it

13:07

and it doesn't feel like I'm even amidst

13:11

the action to learn about it. It feels

13:13

like I'm talking to the commentator or

13:16

reading what the commentator who has

13:19

what he has to say whereas the match is

13:21

happening in another region altogether.

13:23

Mhm. So maybe we can preface this

13:27

conversation with like maybe a brief

13:30

history of compute leading up to AI in

13:33

the manner in the manner that you would

13:36

speak to a 10-year-old boy and we can

13:38

take take it from there. Sure. Uh I mean

13:41

AI has been going on uh since a long

13:43

time. Uh if if like in I think there was

13:49

a project at MIT which declared you can

13:52

solve AI in a summer project like like

13:54

um literally in 3 months and and

13:57

obviously would you want to first define

13:59

what is AI? So AI obviously uh

14:02

artificial intelligence is a field of

14:05

computer science that's uh trying to

14:07

design computers to behave

14:09

intelligently. I was wondering what

14:11

their definition of intelligence is.

14:13

Yeah. Yeah. Program computers to do uh

14:17

tasks that require uh some level of

14:20

intelligence to accomplish them uh in a

14:22

manner similar to a human does it. and

14:26

what is the scope of tasks uh that uh

14:30

require intelligence that that that you

14:32

want the computer could do is where the

14:35

generality comes in. So um so are you

14:39

saying that intelligence is when a

14:42

computer is able to behave like a human

14:44

cuz that itself general intelligences

14:47

general intelligences um right so an AI

14:50

that you write in a in a for a chess

14:52

game that you're building let's say

14:54

you're building a chess game uh as as a

14:56

software project and um obviously the uh

15:01

when when the user picks white and the

15:04

black's playing with an AI um there's an

15:07

AI you write for the game that is not

15:10

really a generally intelligent AI. It's

15:12

it it can only do what you hardcoded it

15:15

to do. Okay, it can assign points for

15:17

HPs. A bishop is this much, a knight is

15:20

this much and it can just run a tree

15:22

search to optimize for that. Uh by that

15:25

I mean it can search for moves, roll out

15:28

few steps and then try to pick the one

15:30

that gives you the maximum score. That

15:32

is people used to call that AI but that

15:36

is not general AI. The reason is

15:39

whatever software you write for that

15:42

cannot do another game even leave alone

15:45

another task. It is a very constrained

15:49

specific setting. Now that is

15:51

interesting by itself. There are a lot

15:53

of things you could do in the world that

15:54

are useful where you break down a

15:56

problem and you write a specific

15:58

solution for that. It's pretty useful.

16:00

But uh what was really on the frontier

16:04

of science at at that time when I when I

16:08

was doing PhD was like how can we figure

16:10

out general intelligence uh in a manner

16:14

similar to a human which is one system

16:18

doing hundreds of thousands of tasks

16:20

without explicitly being programmed for

16:23

it and can be taught new tasks uh and

16:27

and and and it can learn on the fly

16:29

without much much effort. What is it

16:33

optimizing towards though? Let's

16:35

say let's say a AI or AGI is able to do

16:39

millions of different tasks, learn along

16:42

the way. If I were AGI, how would I

16:45

decide what to do first? Yeah. So I

16:49

think that's

16:50

where we get even further in terms of

16:53

like what what is an AGI is like is this

16:55

like an agent that's constantly deciding

16:57

what to learn next on its own? Does it

17:00

have autonomy or is it still um like a

17:04

generally intelligent software but it

17:06

doesn't have any autonomy of its own and

17:08

it doesn't decide what to do next of its

17:10

own. uh is it actually aware of its own

17:14

limitations and and then deciding okay

17:17

like I I I lack this task and this is

17:19

what I I need to go and learn next. No.

17:21

Uh that's not what we have today. Uh

17:24

ideally we should right. Uh what you're

17:26

suggesting is like an AI that uh not

17:29

only learns and trains on stuff that the

17:32

humans throw at it but also like decides

17:34

what to do next in terms of how to make

17:36

itself better. Recursive

17:38

self-improvement. That is not correct

17:39

yet. So is the AI trying to make itself

17:43

better? If a AI can do anything, would

17:45

it want to make itself better? What

17:47

would that be the motive?

17:51

Um, like that would be the ultimate

17:53

motive. Um, if you have an AI that

18:01

uh constantly keeps trying to improve

18:03

itself on any task that it wants to and

18:07

reasons that this is the thing that it's

18:09

worth working on for for the next step

18:11

for itself.

18:13

uh then I think that would be the

18:16

ultimate version

18:17

of uh some people call it even super

18:20

intelligence just

18:22

um a AI has gone beyond the realms of

18:25

AGI which today's systems are let's

18:28

let's say okay what is for the sake of

18:31

ar discussion here let's define AGI as

18:34

like a very very smart version of like

18:37

you know current models like say GPT 4.5

18:41

or five two two or three generations

18:43

later, six or seven where they're doing

18:46

most of the tasks that we do on a

18:47

computer on their own pretty well uh

18:50

with just a simple language instruction

18:53

they can just do it. Uh I think that

18:56

system a lot of people might want to

18:57

call that as an AGI even though it's not

19:00

you know completely an AGI. It's not

19:01

doing physical work that a human does

19:03

and physical work also requires

19:04

intelligence. Let's just say that is a

19:06

pretty reasonable working definition.

19:09

Now that system still doesn't have

19:11

awareness of itself. It doesn't it's not

19:13

aware of what it's bad at, what it's

19:16

good at uh and like what it should do

19:19

next. What are its real goals? It's not

19:21

autonomous. It's not aware.

19:24

Uh so the ultimate uh problem in AI is

19:27

like how do you build an agent that does

19:30

exactly whatever these models do but

19:32

also can keep improving itself and can

19:35

keep coming up with it with its own

19:37

goals and what is its real true

19:39

objective function is it to help

19:40

humanity uh that's what people get

19:43

spooked about usually when they talk

19:45

about like AI taking over humans now I

19:48

think in in in AI community right now

19:50

people are typically referring to the

19:52

second part as super intelligence, not

19:54

general

19:55

intelligence where once you crack it, uh

19:59

there is no way to like control it. Uh

20:01

because you can just say until you shut

20:04

down the system, it's just going to keep

20:06

improving itself. But then there's a an

20:08

argument also that that okay if it's

20:11

that smart that it knows what to do all

20:14

the time and and thinking so smart ahead

20:16

of everybody else. Uh why would it not

20:19

predict that humans might want to shut

20:20

it down and create clones of itself and

20:23

keep staying alive? So that that's where

20:25

like those are getting more in the

20:26

sci-fi territory. Uh but but whatever we

20:29

working with today is like okay it's

20:31

some 10,000 knowledge worker professions

20:34

in one system without any like hard

20:36

coding. That's already pretty crazy,

20:41

right? I'm I'm still trying to wrap my

20:43

head around the definition of

20:46

intelligence almost being humanlike

20:48

behavior. Mhm. If that's what you mean.

20:54

Um, I think like the definition that I I

20:57

I would say is practically pretty useful

20:59

right now is

21:02

um can you create a digital remote

21:05

knowledge

21:06

worker? I think that that is what like

21:09

most people are working towards. Uh it's

21:11

kind of converged to that digital remote

21:14

knowledge worker like an employee that

21:16

you can hire on Upwork. Can it just be

21:19

an LLM?

21:20

Um, but is that intelligence truly? Like

21:23

if you're able to replicate some human

21:26

abilities onto an agent? Mhm. Would you

21:29

would you consider that intelligence? Is

21:32

that the definition that one goes with?

21:35

Well, uh I think like some people are

21:38

pretty uh precise about what

21:40

intelligence is. It's like until you get

21:41

me the uh human brain equivalent in

21:46

software, nothing is intelligent.

21:48

Everything is like narrow.

21:51

And uh there is some merit to that

21:53

argument. Um and and and it's pretty

21:56

difficult to create something exactly

21:58

like the human brain. Um because the

22:01

human brain is amazing. It's power

22:02

efficient. It doesn't consume all the

22:04

data centers in the world to do the

22:06

tasks that we do. Uh and and it's pretty

22:09

fast at learning new things. It's it

22:11

does physical intelligent work too, not

22:13

just digital

22:15

uh dexterity, all that stuff. So yes,

22:18

you can argue it's not really

22:20

intelligence in the human way, but fun

22:23

there's the other way to look at it is

22:25

the functional way. Just look at the

22:27

output and the input. I I give the human

22:29

the same output input and I give the AI

22:31

the same input. And does the AI work

22:34

better than the human on tasks that

22:36

humans are actually getting paid for in

22:38

the will? Software engineering, one of

22:42

the highest paid professions today. It's

22:44

pretty obvious that most human software

22:47

engineers at least like the median human

22:50

software engineer is probably worse than

22:52

an AI

22:53

today,

22:55

right? Typically we've considered people

22:57

who write code as uh smart people. um it

23:02

it's just like thing we've done and and

23:05

so now when an AI is able to do that if

23:08

we say uh that is not

23:11

intelligence then it's kind of like also

23:13

saying okay humans writing code was not

23:15

never an intelligent thing either

23:18

uh right uh you got to apply the same

23:21

standards so then what is intelligence

23:24

kind of changes to uh it's like oh it's

23:26

it's not the fact that humans wrote code

23:28

that made them intelligence it's the

23:29

fact that they can do writing code and

23:32

like designing art and like um you know

23:35

like building a home all these things in

23:38

one person that is intelligence. Now um

23:41

one could again argue that there are

23:43

very few people who are good at doing

23:44

all these three simultaneously right uh

23:47

most people are good at only one or two

23:50

things or like and they have hobbies but

23:52

they're never like world class at the

23:54

hobbies and so uh that's where we are

23:56

getting at is like is if if humans are

24:00

also limited in what they can really do

24:02

in a world-class manner and and and one

24:05

AI system that can write code better

24:07

than the median software engineer is

24:09

also writing writing uh emails better

24:11

than the median uh executive assistant

24:13

and is also like doing um you know

24:16

writing essays better than the median

24:18

writer that's pretty intelligent that

24:21

system or whatever software system it's

24:23

definitely not um humanlike but as an

24:28

output that it produces it's pretty good

24:30

or like better than most humans right

24:32

now and so I would consider it an

24:34

intelligent system right uh and and very

24:36

different kind of intelligence than like

24:38

a calculator or a chess program. Yeah.

24:41

No, I actually like the definition of

24:42

intelligence that if a computer is able

24:46

to mimic what a human does, it can do is

24:50

intelligence. But then along that

24:52

spectrum, computers for a long time have

24:54

been able

24:56

to replicate or better human tasks,

25:00

different things like maybe mathematics

25:02

for example. Correct. Has all of that

25:03

been under this purview of intelligence?

25:07

uh they used to be done in the research

25:10

field of AI like uh you know when deep

25:13

blue beat Caspar in

25:16

chess it was considered like AI research

25:19

you know Monte Carlo research was AI

25:21

research but

25:23

then people were not like oh this is

25:27

insane people were more like

25:31

um how do we make this work in a more

25:34

general way that mhm it's you're not

25:38

just doing it for one task. How do we

25:41

build something? Intelligence does int

25:43

intelligence have to be general. It can

25:44

be narrow and if something is

25:46

intelligent at one thing, you would

25:47

still define it to be intelligent,

25:49

right? Yeah, sure. I I think I think I

25:51

think you can definitely define it to be

25:53

intelligent. Um, so what what I'm kind

25:55

of looking for is the distinction

25:57

between a calculator doing maths and

25:59

what we call intelligence today because

26:01

a calculator does something a human

26:04

could do better than he could do. I

26:06

think you can definitely call it

26:07

intelligence. Uh, but then that's when

26:10

you call anything an AI like anything

26:12

that makes any kind of prediction uh

26:15

becomes an AI. Uh the the reason that

26:18

people genuinely consider

26:20

uh a more general system is more

26:22

intelligent is because it's um harder to

26:25

like just overfit a solution to like you

26:29

know 10,000 problems at once. Um you can

26:32

it's not like 10,000 programs being

26:34

written and then stitch together. Uh

26:37

it's more like one program that's able

26:39

to do 10,000 the equivalent of 10,000

26:41

programs simultaneously.

26:43

uh the same like like the same system,

26:45

the same piece of software which is just

26:47

a bunch of weights of a neural

26:49

network, whatever input you feed to it,

26:52

you ask it to write code, you ask it to

26:53

write a poem, you ask it to write an

26:55

essay, you ask it to summarize a

26:57

document, the same system does these

27:00

tasks in one way, right? I think that is

27:04

what is amazing. That's the generality

27:06

coming from. I could have written 10,000

27:08

different programs each for these each

27:10

of these different

27:11

tasks and um you know have a router that

27:16

tells which program to use for what

27:18

input you will give me and that would

27:20

still appear intelligent to you but it's

27:23

not truly uh general. So when you throw

27:26

when you throw the 10,000 in1 task

27:29

slight variation I might not be able to

27:31

do it but a more general system that

27:34

just uses one piece of code will be able

27:36

to do it and I think that is uh the

27:38

power of of generality like the fact

27:41

that humans in a way you're also saying

27:43

that intelligence in today's context

27:45

we're all talking

27:47

about AI having changed drastically over

27:51

the last I don't know whatever time

27:53

frame so you're I think it has moved

27:55

from narrow to more general in nature

27:58

where it can't just do one task one task

28:01

like a calculator but it can learn how

28:04

to do another and solve it. Is that

28:06

correct? Yeah. Yeah. Is that the

28:08

distinction you're drawing? And I think

28:10

that is that is why people

28:12

are way more excited like this

28:15

is and and and and

28:18

interestingly he was able to do stuff

28:20

that people are getting paid for. Mhm.

28:24

Um, so it'll have economic implications.

28:26

So unlike the previous cases where it

28:29

mastered chess or go people found it

28:32

cool, but nobody really cared because

28:34

they couldn't use it on their own in a

28:36

daily basis unless you're like a player,

28:38

professional player. Um, on the other

28:41

hand, like a lot of people writing code,

28:43

a lot of people are writing documents, a

28:44

lot of people are summarizing things. I

28:46

I think that's where um um I would

28:52

say it's it's beginning to feel like you

28:56

hired another human for your work.

28:58

Right. Right. And and and so it's truly

29:00

replacing human labor in a meaningful

29:03

way. Yeah. This is another stupid

29:05

question, but I'm going to ask it. Uh

29:08

how a calculator worked. I'm using the

29:11

most basic of examples. How a calculator

29:14

worked back in the day. Mhm. when I hit

29:16

on a computer multiply 25 into 25 or 20

29:20

into 20 what happened on the back end

29:24

that I couldn't see how did it throw the

29:26

output we'll start there and we'll try

29:28

to extrapolate all the way to what's

29:30

happening today sure I mean there were

29:33

circuits for adders

29:35

multipliers uh and and these are the

29:37

circuits that are

29:39

running on the back depending on your

29:43

input that you enter it's getting parsed

29:45

first and then that input is getting fed

29:48

into these circuits and then you're

29:50

getting the output and you can build

29:53

even mechanical circuits. That's a

29:55

beautiful thing like once you once it

29:57

works all the you know there are like

29:59

nice visualizations of how adders just

30:01

work in a completely mechanical way. So

30:05

uh you don't actually need that much

30:07

power to uh make this work. I think I

30:10

saw somebody say a beautiful things like

30:12

a calculator is such an amazing p

30:14

artifact that uh if you took it uh let's

30:18

say from 2025 and let's say you time

30:21

travel back to

30:23

1800 it would still work the same way.

30:26

Uh it like solar let's say it's solar

30:29

powered u it would just work the exact

30:32

same way. Um and and that's fantastic

30:36

because you cannot say that about uh

30:40

like say your MacBook. You're not going

30:42

to be able to power it, right? That's

30:44

actually useful. The way you described

30:46

it helps me visualize when you say it's

30:49

mechanical. I can imagine a IC

30:53

doing what I can pictureize mechanically

30:56

in my head like I think you had 10.

31:00

Yeah. There there are some you can go to

31:02

YouTube and watch some how like you can

31:04

have um a binary counter

31:08

three-digit binary counter that's

31:10

completely mechanical and and um it's

31:14

pretty beautiful. So okay and then what

31:17

what changed in computing after that?

31:20

Well, a lot of things obviously um you

31:23

know you got to go very deep into like

31:26

what people built with calculators like

31:28

other other devices and so on. But um I

31:31

think the biggest change I would say was

31:33

the personal computer revolution. We we

31:35

had mainframe computers, right? Uh but

31:38

then the biggest change that kind of

31:40

truly made computing

31:43

uh very democratized and ubiquitous is

31:46

uh people being able to have a personal

31:48

computer at home. Uh that was the whole

31:53

you know Apple 1, Apple 2, IBM. Yeah. So

31:57

sticking to the IC example, Mo's lawy

32:00

IC's got smaller and smaller so you

32:02

could have enough compute at home to do

32:04

these same calculations that before you

32:07

needed a main. Correct. Correct. Not.

32:09

Yeah, definitely Morus's law is one of

32:11

the critical reasons it happened. But um

32:13

also like lot of artistry in the

32:15

beginning to package

32:18

uh a lot of computations in a very

32:21

compact way into like one board that

32:24

could be put into a portable computer

32:26

was pretty amazing. Uh a lot of people

32:29

actually in the beginning were

32:31

skeptical. They thought that it's not

32:34

going to matter like why would people

32:36

need a computer at home, right? It's

32:38

just like stuff you do for work and uh

32:42

uh that's where the beauty was hey

32:44

people might want to actually work at

32:45

home too. Of course, games was a big

32:48

deal, but the real reason computers took

32:51

off, personal computers took off is the

32:53

software called Vissical

32:55

uh which essentially uh spreadsheet and

32:58

calculator and and and uh so that led

33:01

like people who were doing accounting

33:04

uh do the work at home and um slowly it

33:08

spread like more software started being

33:10

written for personal computers um and

33:12

and uh and and so like that be that made

33:15

like personal computing fun. Now after

33:18

that is the network effects where if you

33:21

had a personal computer at home and I

33:22

had one and we could figure out a way to

33:24

talk to each other which is internet and

33:27

then uh the worldwide web and then like

33:30

mobile uh cloud and now AI. So it's very

33:35

like simplistic way to describe it. And

33:37

there's a lot of details here, but this

33:38

is No, it's actually very useful cuz I

33:40

feel like whenever I try and learn more

33:46

about this field online or with the

33:49

people I speak

33:50

to, I feel

33:52

like I get the

33:55

highlevel

33:58

nonuanced generic stuff that everybody

34:00

is saying, but I'm not able to I don't

34:03

have that bridge in my brain which goes

34:05

from okay, it started like this then it

34:07

this happened then that happened. I did

34:10

a I interviewed Yan recently Yan Lakun a

34:12

few months ago and spoke many hours and

34:15

we went into

34:17

like I spent a lot of time sitting and

34:19

trying to learn about Jeppa and the

34:22

machine learning and neural networks and

34:24

what his creations were but again the

34:26

manner in which I explained it or we

34:29

kind of like tried to portray I think

34:31

got too muddled because I don't have a

34:33

clear understanding of much of this. So

34:36

let's say when you say we moved from the

34:38

internet to like today's AI when

34:41

everybody's talking about AI what was

34:44

that one thing if you had to fixate on

34:46

one thing why is AI in 2025 different

34:49

from when people spoke about AI in 2010

34:53

I think uh the biggest change from 2010

34:56

to 2020 or like the 2020s I would say

34:59

not just 2025 is uh this thing called

35:02

neural networks actually

35:06

work and I would say the forefathers

35:10

like Leon or Hinton Benjio they did a

35:12

lot of work to establish the foundations

35:16

but one guy

35:18

single-handedly you know with with with

35:20

of course with a group of amazing

35:22

engineers who worked with them truly

35:24

made it work I'd say it's Ilia Sutsker

35:27

um and and I think the magic sauce was

35:31

throw a lot of data and compute at it uh

35:34

and Now you can ask like, "Oh, wait. Is

35:36

that really all? Like um was it really

35:40

that simple?" And yes, like honestly,

35:44

yes. And I think uh that's where like it

35:48

was it it came down to blind faith in in

35:51

doing things. I'm sorry I'm interrupting

35:54

you again, but can you explain what a

35:58

neural network actually is? I have a

36:00

little bit of history with this because

36:01

I work in the stock investor world and

36:05

we've had neural networks for a long

36:07

time and I remember seeing this over

36:10

much of the last decade where you would

36:12

put in a lot of maybe a bunch of

36:16

different data factors that we have like

36:17

maybe time, price,

36:20

volume and put all this data into a

36:24

neural network. try to get it to predict

36:27

what will happen next and start maybe a

36:30

robo advisory kind of service or you

36:33

know try to figure out how a computer

36:34

might be able to predict but none of

36:36

this played out in the manner that we

36:39

perceived it when it came to the stock

36:41

market but maybe you can define for me

36:44

it didn't play out then I'm talking over

36:46

the last decade can you define what is a

36:48

neural network in very simple words so a

36:52

neural network is a network of

36:54

artificial neurons

36:56

uh connected to each other layer by

36:58

layer. Um and what is an artificial

37:01

neuron is just like a computational unit

37:04

that takes an input number and gives you

37:06

an output number. U and so it's it's

37:10

it's called a neural network because

37:11

it's inspired from the

37:14

biological neural network which is the

37:16

human brain. Um but it's not

37:20

exactly meant to be working the same way

37:23

either. In fact, that's actually why in

37:25

practice it works because a lot of

37:26

people tried to make it work the exact

37:28

same way and failed at it. But think

37:30

about it as like a massive circuit that

37:33

you're feeding numbers to and it spits

37:35

out new numbers. Um, and it's spit out

37:39

new numbers based on the numbers that I

37:42

have put in and the patterns it

37:43

recognizes in those. Correct. Yeah,

37:46

exactly. In the stock market example, if

37:48

we were to just stick to

37:50

it, when you put so much data into a

37:54

neural network and it predicts what

37:56

might happen tomorrow based on what has

37:58

happened yesterday, stock markets often

38:01

tend to be random. And there is a school

38:06

of thought they call it technical

38:07

analysis where people believe that

38:10

patterns exist and they try and map out

38:12

what patterns happened in the past and

38:14

how they will repeat themselves

38:16

specifically. But what

38:18

if this is a bit selfish cuz I'm I'm

38:21

sticking to the stock market example.

38:23

But what if the past patterns do not

38:25

recur in the future? Then what does the

38:27

neural network predict?

38:30

That's a good question. So a neural

38:32

networks Look, neural networks can be

38:35

trained to predict anything, right? The

38:39

stand alone without the prediction

38:41

function, the loss function. Just the

38:43

neural network alone is simply

38:46

a mathematical function.

38:49

Mhm. Very nonlinear. Think about it as

38:52

like some extremely high order

38:53

polinomial function, right? Um is this

38:56

going to What was the last word you

38:58

said? Auto. extremely high order

39:00

polomial function right um by by that

39:04

all I mean is like very nonlinear

39:07

um lot of higher order interactions and

39:10

multiplic multiplications can you help

39:12

me picture a neural network you said it

39:15

was meant to mimic brain chemistry but

39:17

it doesn't yeah think about as like okay

39:20

let's say you're you're feeding in like

39:21

three or four numbers at the input layer

39:24

uh the first layer will take that and

39:26

like transform that Imagine it applies

39:30

some some sinosoids or like do you mean

39:33

when you say transform are you talking

39:34

about transformers in Google and their

39:36

development and stuff like that or no I

39:38

don't mean specifically a transformer

39:39

but I just mean like a mathematical

39:41

function like some function f of like

39:45

those four numbers where that function

39:47

is being learned but the way it's in in

39:50

practice it's implemented is there is

39:52

like a matrix it takes those matrix

39:54

start with a bunch of random numbers in

39:56

the beginning and then and they multiply

39:59

with the input you feed and then there's

40:01

some sinosoid or like some kind of like

40:04

nonlinear function that takes that and

40:07

modifies it. Now why do you need that is

40:09

because that's where you bring in the

40:11

higher order dependencies. You're

40:13

learning that and then you imagine doing

40:16

this over like four or five different

40:18

layers. Um and then you have a bunch of

40:21

outputs. It could be four outputs. It

40:22

could be 40 outputs. That depends on

40:24

your the way you constructed the neural

40:26

net.

40:28

And then there is like a target output

40:30

that you have based on the data set and

40:34

the current prediction is taken and the

40:36

target output is taken. Uh the the

40:39

difference is calculated and you're

40:41

you're updating the the parameters of

40:43

the neural net which are those matrices

40:45

at each layer to um update themselves so

40:49

that you minimize the loss but not the

40:51

loss on one single input. a loss on like

40:54

a giant data set like millions of

40:57

millions of examples

40:59

and go back to the stock market example

41:03

and when we when we would put data into

41:05

a neural network and we didn't get the

41:06

output we desired we would go back and

41:09

curve fit the data in a manner to get a

41:14

more desirable output but does it still

41:16

kind of rely on the premise of

41:19

recognizing patterns historically

41:22

implicitly Yeah. Yeah. So if it's if it

41:26

has to do its job of predicting the

41:29

output

41:30

reliably, then it has to recognize

41:33

whatever patterns it needs to be able to

41:37

do that, right? Um

41:41

like like let's say um I'm I'm going to

41:44

the example of just predicting the next

41:45

word. Uh if if a neural network has to

41:48

be good at predicting the next word

41:50

given the previous word, then it

41:52

implicitly has to understand grammar,

41:55

you know, sentence construction, common

41:58

sense, all that stuff. Or like if if a

42:00

neural network has to predict the next

42:03

uh like like character in a pro program

42:06

that you're writing, it has to somewhat

42:07

understand the logic, all that stuff. So

42:11

it really depends like like like whether

42:13

a neural net captures the useful

42:16

patterns really depends on what is the

42:19

task that you're training it on. If

42:21

you're training it on the raw stock

42:23

price, let's say you just have a bunch

42:24

of numbers of the stock price of Nvidia

42:27

uh opening price every single day, sure,

42:31

it's not going to be useful on its own

42:33

because there are so many other factors

42:35

that influence the price and it if if

42:37

all it had is like the each day's

42:39

opening

42:40

price, it's not going to really there's

42:43

not really that many patterns in in that

42:45

anyway. So uh there's this thing in

42:48

machine learning called uh the model can

42:50

only learn um whatever like like

42:54

like actual patterns exist and

42:57

everything else that's in the data

42:59

that's noise is irreducible noise. By

43:02

that I mean like no loss function can

43:04

hope to capture any of it. You can um

43:07

exactly fit it but it's not going to

43:09

generalize. So um as long as there is

43:12

something that's truly signal in your

43:15

data uh and and the way you crafted the

43:18

task can capture the signal like doing

43:21

the task or requires you to capture the

43:23

signal then yes the model will

43:26

definitely be able to capture

43:27

interesting patterns um and and when you

43:30

said machine learning I'm sorry again

43:32

can you distinguish what is neural

43:34

network and what is machine learning the

43:35

difference? Yeah. So um neural networks

43:38

is one way to do machine learning. Uh

43:41

but machine learning and how would you

43:42

define machine learning? Yeah. Uh

43:44

machine learning is broadly u train a

43:47

computer program to

43:49

um do something intelligent uh or or

43:53

make intelligent predictions on on um

43:56

data sets that you're given uh such that

44:00

you're given a recorded bunch of inputs

44:03

and you want to be able to make

44:04

intelligent predictions on new inputs

44:06

that you've not seen

44:07

before. Um and and the predictions could

44:10

44:11

anything and and neural networks happens

44:14

to be a particular way of doing machine

44:19

learning where the predictions are done

44:22

through uh this abstraction called

44:25

neural net that takes in an input and

44:28

like applies matrices

44:30

nonlinearities and then like stacks them

44:34

repeatedly and then makes predictions

44:36

out and updates the predictions using

44:38

back propagation the the the way to you

44:41

know change the weights depending on

44:43

your loss. So there are so many other

44:46

ways to do machine learning. There are

44:47

like you know support vector machines,

44:50

linear regression, logistic regression.

44:52

There's like a whole bunch of techniques

44:55

but it happens to be that uh neural

44:58

networks is the one way to do things

45:01

when you want really want to benefit

45:03

from scale like like like the prediction

45:06

should keep

45:07

improving the more data you throw at the

45:10

problem or more compute you throw at the

45:11

problem. Neural networks happens to be

45:13

the most uh scalable way to do things.

45:16

But if you have like only 100 or 200

45:18

examples, uh other algorithms might be

45:21

working as well too. So where does a

45:24

large language model sit amidst all

45:27

this? What is it?

45:29

So a large language model uh is

45:32

essentially a giant neural network

45:35

that's trained on this one task of

45:38

predicting the next word from the

45:39

previous word except it's training on

45:41

the whole internet. So it's training on

45:43

terabytes of text, trillions of tokens,

45:46

and it's doing it's it's training on

45:49

books, code, and um uh textbooks and

45:52

like general web pages, news articles,

45:55

all these things. So um distinction

45:59

being just text, it's not training on

46:01

videos and pictures and stuff like that.

46:04

Uh I think I think like uh it can uh but

46:07

but since you're calling it large

46:08

language model I'm I'm keeping that most

46:12

people. Let's take charge GPD for

46:14

example. Yeah GPD. Yeah. So the image

46:17

part uh like like taking in an image and

46:20

captioning it and all that stuff comes

46:22

only uh in a different phase of training

46:24

called the post- training. Uh but most

46:26

of the most of the compute is thrown at

46:28

just predicting the next word from the

46:30

previous word. That's called the

46:31

pre-training. Um,

46:34

and so, so essentially think about the

46:38

data set. It's the same thing. Think

46:39

about the data set as being the whole

46:41

internet dump like all of Wikipedia, all

46:44

of Reddit, everything like that. You you

46:46

you you download it from the web. You

46:48

you tokenize it. that is you you know

46:52

convert every sentence into a bunch of

46:53

tokens and then you store it somewhere

46:56

in in your S3 dump and then

47:00

um feed like you know 4,000 words and

47:04

ask it to for each of those 4,000 words

47:06

you ask it to predict the next word

47:07

given the previous word. Right.

47:10

Transformer comes in. Correct. Exactly.

47:11

This is where the transformer comes in

47:13

which is a particular neural network

47:16

architecture. Uh that's pretty

47:19

efficient and um you you shard the model

47:23

which is the neural network model on

47:26

like thousands of GPUs and and and and

47:29

learn on like trillions of tokens on

47:31

like train this model for like 3 or 4

47:34

months and it's pretty amazing. artifact

47:36

emerges out which is it'll it'll be

47:39

great at like predicting the next word

47:41

but it's still not conditioned enough to

47:43

be practically useful and so that's

47:45

where the post-raining process comes in

47:47

where you u train this or fine-tune it

47:51

finetune this model to be a good chatbot

47:54

u which is training it to produce good

47:56

responses to human inputs uh and and and

48:00

um that requires a separate data

48:02

collection phase where you're collecting

48:04

data for practically useful task like

48:06

software programming, compressing

48:08

emails, summarizing documents, uploading

48:11

PDFs, and like having it summarize

48:12

things or answer questions about it. Uh,

48:15

and and and then also like just generic

48:18

conversational outputs where you're

48:20

training the model to be like

48:22

conversationally good, keep references

48:24

of the past and stuff. And once you do

48:26

that, like you end up with a system like

48:28

Chad GPT, right? When I was speaking to

48:32

Yan, I mean, I asked him things like,

48:34

you know, explain tokenize to me a dozen

48:36

times and all of that, but he seemed to

48:39

think that the current path of evolution

48:43

of where large language models are going

48:46

is not the path to AGI. He had a counter

48:51

opinion on it. Can you elaborate on that

48:52

a bit?

48:53

Well, again, like he he has his opinions

48:56

and I think, you know, um he's generally

48:59

been right, so it's worth listening to

49:01

him.

49:04

Um I would say that

49:07

um what Yan wants is like physical

49:09

common sense to like he he counts that

49:13

as a prerequisite for something to be

49:15

deemed as

49:16

AGI. Uh by that I mean like just basic

49:19

stuff that we all take for granted that

49:21

we we do on a daily basis which is how

49:23

to pour water onto a cup. How to like

49:26

let's say you're a waiter in a

49:27

restaurant and you have to pick up like

49:29

three glasses and two coffee cups with

49:31

two hands. Mhm. How do you do it? Right.

49:33

Like you're pretty clever. You take you

49:35

you u tilt them in a way and make sure

49:38

they don't break um and so on. Or like

49:41

oh you have a new uh bottle of wine. how

49:44

to even use the an opener that you've

49:47

never used before. You figure all these

49:49

things out pretty quickly. Uh the tool

49:51

use that comes on a daily basis. You

49:54

know, like it's not good to mix two

49:56

ingredients that are not supposed to be

49:58

mixed. I think these are the things that

50:00

like he thinks a generally intelligent

50:02

AI should do. like stuff a cat figures

50:05

out to just get from one place to

50:07

another when they're like like a maze uh

50:10

or like how a rat behaves in a maze to

50:12

get to where they and and status despite

50:16

all the blocks. I think these are things

50:18

that you know like some model like GBT4

50:20

or five can cannot really do right now

50:22

right. What is the path to that to that?

50:26

Like if I think the example of picking

50:29

up a glass is great because

50:32

if for me picking up a glass is this

50:34

easy and if I were to train a computer

50:38

model to do it, it would require so much

50:40

energy compute and it seems like you

50:44

know you'll have to probably build an

50:45

arm and figure out how the fingers move

50:47

and all of that. So, the job of a waiter

50:50

picking up a glass in a

50:53

restaurant is likely not going to be

50:55

taken over by a computer. Not anytime

50:59

soon. Which is funny because that's not

51:01

paid as much as uh someone gets paid to

51:04

write code today, right? So, it's a it's

51:07

like it happens in the reverse way like

51:10

everybody wants to think what they do is

51:12

the one that's the one that's going to

51:13

be taken by AI uh the last. So but but

51:19

let's go back to your question. So a lot

51:21

of things happen uh in the human brain

51:25

uh in a in a split second. That's pretty

51:27

amazing. Uh so the way computers work

51:30

right now is they would have to watch uh

51:34

a YouTube videos of people picking up

51:36

cups. Uh, and then they would have to

51:38

have like a physics simulation

51:41

environment where they train a robot

51:43

with a suction gripper or like maybe a

51:45

five fing four or five finger dextrous

51:47

hand. Uh, attempt to pick the cup like

51:50

thousands of times or tens of thousands

51:52

of times and then like learn what its

51:55

success and failure based on whether the

51:57

cup was actually picked up or not. And

51:59

then do this in several different uh

52:02

gravity environments so that it

52:03

generalizes to new settings. uh do this

52:07

on se several different visual settings

52:10

and still it might fail if there's a new

52:12

material that's dropped like a new glass

52:14

it still might fail. So I I think that's

52:17

where generalization across different

52:20

physics settings is still like pretty

52:22

bad. It's not like training on the

52:23

internet. There's not enough data. So

52:26

you actually have to build something

52:27

that's truly uh intelligent so that it

52:30

can learn with very little data. I think

52:34

uh we humans by the way you might say

52:36

it's pretty efficient but we have had

52:38

the uh luxury of like evolution right uh

52:42

we evolved like like all the basic

52:44

physical skills that we have like

52:46

walking running uh like doing things

52:49

with our hands is something we've

52:50

evolved to do over several years and um

52:55

I think AIS definitely need to spend the

52:58

compute power to train and and so we

53:00

shouldn't compare the training compute

53:02

to inference compute

53:04

um and so that the the best way to solve

53:07

this problem is like

53:09

reasoning physical common sense and

53:12

reasoning uh is what you need. So you

53:14

you parse the scene and then your

53:17

planner or reasoner just like how you're

53:18

watching like these AI agents now like

53:22

construct a plan to solve a hard task. I

53:25

think you got to do the same thing for

53:26

physical task. Okay, if I want to pick

53:28

up three cups together, like this is

53:29

what's likely to happen. If I do this,

53:31

that and then okay, then this looks like

53:33

this is the optimal way to do it and I

53:35

do it. I I think that's would it also be

53:37

a similar neural network which would

53:39

learn from I don't know videos or

53:42

definitely it has to be it has to learn

53:44

from videos like uh but it also has to

53:47

build a mental models

53:50

such that even for scenarios it's not

53:52

watched on a video before it should be

53:54

able to reason and do things right so

53:58

Arvind if I were to ask you what changed

54:02

like what changed in the last couple of

54:05

years that this has taken over

54:07

everything like this conversation

54:11

I would say it's

54:13

um a lot of compute thrown at the

54:18

problem unprocedented at scale um the

54:22

key realization that it's not just

54:25

compute also uh it's it's high quality

54:27

data and and RHF learning from human

54:31

feedback

54:33

and actually like

54:35

um training it on tasks useful to human

54:38

labor like coding and like summarization

54:41

and stuff like that all came together

54:44

simultaneously

54:46

and do you think the one main thing is

54:48

throwing immense amounts of compute at

54:52

the problem without definitely that is

54:55

if there's like highest order bit I

54:58

think it's without worrying if the

54:59

outcome is going to make up for the

55:01

revenue spent towards the computer

55:04

Yeah, I think so. Um is that the

55:06

distinction?

55:07

Definitely because like um the compute

55:10

and and and by the way computer alone is

55:13

useless like people have tried to

55:14

reproduce these things with doing the

55:16

same thing and it doesn't work. You got

55:19

to throw high quality data tokens at the

55:21

problem too. So that taste on like

55:23

curating data sets of like what will

55:27

really matter like for example if you

55:29

want reasoning to emerge in a model it's

55:31

good for you to like make sure you have

55:33

YouTube transcripts of video uh like

55:35

like like uh lectures uh MIT lectures

55:39

Stanford lectures

55:41

um and and textbooks like where you

55:44

actually have problems where it's not

55:47

just the problem but the solution is

55:49

explained step by step. So when the

55:51

models learn this, you can actually

55:53

prompt engineer them at inference time

55:55

to say think think step by step and then

55:57

it's able to think step by step and

55:58

solve a problem. Then that leads to the

56:00

next idea which is chain of thought. Uh

56:03

where you train the models you you

56:04

collect a data set with like chain of

56:06

thought where the model is not just

56:08

solving the problem but it's actually

56:10

understanding why something is right and

56:12

wrong. So even if it's wrong, it can try

56:14

to like go back and rethink and like

56:17

iterate and improve itself. So I think

56:20

like there are like a bunch of three or

56:22

four key ideas that came one step at a

56:24

time. Uh and and they all stacked on top

56:28

of each other. Um but but but the main

56:31

realization is highquality data sets

56:34

with a lot of compute and trained to be

56:37

uh conversational uh with human feedback

56:42

made accessible to all people through a

56:43

simple chatbot interface um made magic

56:47

happen. I think I think um I think it's

56:49

a lot like lot of like four or five good

56:51

good things coming together.

56:55

Great. I think I have a better

56:57

understanding of where we are at right

57:00

now. I'm going to digress for a minute.

57:02

You said Bangalore. How long were you

57:04

there for and what were you doing?

57:05

That's where I'm from. You mean for my

57:08

internship? Yeah. You said you finished

57:10

the problem in 3 weeks.

57:13

Yeah, probably. Uh I I think I was in

57:16

this place called Cor

57:18

Mangala. Um and um I didn't actually

57:23

explore. So I I I just worked all the

57:27

time which is you

57:29

know now that I look back I probably

57:31

think I should have explored but um no

57:33

now when I look at you I think you did a

57:36

good thing by not exploring K Mangala

57:38

and working all the time. No, not not

57:40

not I I don't mean to explore core

57:42

Mangala but I I just meant explore

57:44

Bangalore in general that that I wish I

57:47

did but um I do remember the traffic

57:50

being bad and I'm being told it's even

57:52

worse now. So so probably not probably

57:56

good that I stayed in the room and just

57:58

worked but otherwise like uh I do

58:01

remember the weather was awesome

58:02

compared to uh Chennai. I think weather

58:05

was much better.

58:07

Um, what did I do? You still follow

58:10

cricket? Uh, yeah, I do. Yeah, I

58:13

followed the match on Sunday. I'm

58:16

actually in Dubai cuz I came to watch

58:17

the match. This is my hotel room in

58:19

Dubai. Oh, cool. It was good. It was

58:21

good. I feel like the stadium had maybe

58:28

99.99% Indian support and 01% New

58:32

Zealand. They had like a tiny box with

58:35

30 people in there. Wow. So, everybody

58:37

walked away happy. Most people walked

58:40

away happy. Yeah.

58:42

Yeah. I mean, like I I was pretty

58:44

disappointed in the last three or four

58:46

um times we lost in the semi-final or

58:49

the final. So, I was really hoping that

58:52

um India wins this time. So, that was

58:55

awesome.

58:57

But honestly, like I I want I want India

58:59

to win the 2027 World Cup. I think that

59:01

that'll be pretty big. Mhm. And you were

59:04

saying about Bangalore, what do you

59:06

remember of it?

59:09

Uh to be very blunt, I only remember

59:11

that I worked really I worked all the

59:13

time. Uh have you been like this all

59:15

your life?

59:18

Uh I mean like I I I I yeah I worked I

59:21

worked pretty hard. Um I'm very proud of

59:23

that and um Is there a why you work very

59:27

hard and you're proud of it? I think I

59:29

enjoy it.

59:31

I'm not doing it because oh like you got

59:33

to do this and then you'll achieve that

59:36

and because that's that's uh impossible

59:38

to scale. That's how most people are

59:40

when they when they're studying for IIT

59:43

or like when they when they when they

59:44

when they're trying to study for grades

59:46

to get good grades in IIT. Most people

59:49

are uh that way where they do it because

59:53

it'll get them a reward.

59:57

Um, I think some of it applied to me

59:59

too, but I I mostly do it because I

60:02

enjoy it and um I think that's why I'm

60:05

able to still keep doing it.

60:08

Which part do you enjoy? Is it

60:11

the Is it the stuff? There's so many

60:14

things you've gotten at this end of the

60:16

bridge. Which part have you enjoyed the

60:18

most and which part you enjoy most

60:20

today? I think I enjoy the intellectual

60:23

part of like learning new things, being

60:25

curious and learning new things. Um,

60:27

yeah, you're going to be disappointed in

60:29

this conversation.

60:31

Well, I you know, one thing they say is

60:33

60:37

u you might think you know something

60:39

until someone asks you the most basic

60:42

questions and then you're like, "Okay,

60:45

let let me let me figure out a way to

60:46

explain it. Let me figure out a way to

60:47

explain." then you're truly testing the

60:49

limits of your understanding. So I

60:52

actually enjoy these kind of I had a

60:54

similar chat with Lex Freriedman where

60:57

he made me go through the whole like um

61:00

history of like AI and neural nets and

61:03

like search and you know how like Google

61:05

makes money and I'm like wait I I

61:08

actually thought I knew all this but

61:10

this guy is really testing and making me

61:13

question if I really know stuff and I I

61:15

enjoy those kind of um conversations

61:18

actually because it's pretty rare to

61:20

talk And it's not like you oh make a

61:22

list of the easiest questions, right?

61:24

You're actually trying to go deeper and

61:26

deeper and um this is also

61:29

why we even built this product. It's let

61:32

letting people do that on their own and

61:34

and no question. There's even a saying

61:37

from Confucious, right? That um you you

61:40

might feel like a fool if you ask a

61:42

question, you know, supposedly a simple

61:45

question. And you might feel like a fool

61:46

for a minute, but you'll be a fool for

61:49

your lifetime if you don't ask it. Uh,

61:52

and so I think uh I'm actually like

61:54

always in favor of people asking

61:56

questions.

61:57

Use that. I'm going to use that for the

62:00

rest of today and ask you more stupid

62:01

questions, but go on. Yeah. Hopefully

62:05

hopefully I can come come out answering

62:07

them well. Um, but yeah, I I I I

62:10

genuinely enjoy like learning things. I

62:12

I enjoy the challenge of trying to do

62:15

hard things. Like in general, my my uh

62:17

whole life has been like um trying to do

62:20

something that seemed like pretty

62:22

impossible. Uh I'm not necessarily from

62:24

a rich background. So um most of the

62:27

stuff we did you know my myself my

62:30

parents uh to get here uh get into IIT

62:34

or like like get into Berkeley uh get

62:38

you know get a job at OpenAI like

62:40

started this company. Can I ask a

62:41

question? Yeah. When you spoke just now

62:44

you said we did got into IIT and got

62:47

here now we got here now. Do you think

62:49

of your family your parents that way

62:52

like Yeah. I mean like I I I did all the

62:55

work to study and you know do the exams

62:59

well but they took care of the other

63:01

stuff for me right uh and so it's not a

63:04

individual thing same thing now like I'm

63:07

doing the work running the company but

63:08

my wife takes care of so many things for

63:10

me at home um and uh it's not just about

63:15

the support at home or something it's

63:19

more the moral support I I think you

63:22

have very few people to lean on to and

63:24

and so uh there are so many times when

63:28

you're not like necessarily feeling the

63:30

best about your chances and uh there's a

63:34

lot of things that you cannot share with

63:36

your own fellow colleagues because you

63:39

know as founder CEO you always have to

63:42

appear like I'm I have it all figured

63:43

out so there's somebody you need to go

63:46

and talk to uh for help and and and like

63:48

just simply like or or someone to push

63:52

you also Right? Like sometimes you might

63:53

when things are going well like you

63:55

might feel like you're on the top of the

63:56

world and someone has to like bring you

63:58

back to earth and say, "Hey, like calm

64:00

down." Like you have nothing figured out

64:02

yet. And and so who does that for you?

64:05

My wife does that for me. Yeah. Wow. And

64:07

then back back when I was studying for

64:09

IIT, it's like my mom was always like

64:11

keeping me in check and and and making

64:14

sure that uh I was focused and and it's

64:16

important like there's you can't have

64:18

too many people doing that for you sim

64:20

at the same time like the more people do

64:22

that like it gets chaotic. Uh and so

64:26

it's good to have like one or two people

64:28

doing this all the time. What did your

64:31

parents do Arvin?

64:33

My mom uh works in the government for

64:36

the central government and my dad was uh

64:38

uh an accountant so we financial

64:42

accountant so we I'm actually the first

64:45

engineer in in like extended family uh

64:49

and um so really you're you're

64:56

yeah your first correct yeah but our our

64:59

family had more

65:01

um more the um accounting background

65:05

like and so and so we we we engineering

65:09

was still like a new thing at the time

65:10

for us. Our audience in this particular

65:13

thing uh we speak to wannabe

65:17

entrepreneurs from India largely who are

65:20

under the age of 25. Mhm. But for a

65:23

second, I'm going to put on my investor

65:25

cap and ask you

65:28

what the big players are doing. How do

65:32

you distinguish one from another? Mhm.

65:36

And maybe you can give us a bit of

65:38

nuance of how is one different from

65:40

another. Like you take a gro, you take

65:43

what a meta is doing, you talk about

65:45

what Microsoft is doing. Yeah. maybe

65:48

like just like you know like

65:50

really low-level stuff that I can

65:53

understand. Yeah. Honest answer right

65:55

now is all of them are doing similar

65:59

things. Okay. Like uh I'll just say it

66:02

as blunt as it can be. uh is there's not

66:07

really a genuine differentiation between

66:11

uh Chad GPT or Anthropic or Gemini or

66:17

Gac or Meta

66:19

AI right now. And um of course

66:24

perplexity you can argue similarly which

66:26

is you know in the beginning the

66:27

differentiation was we were the only

66:29

ones to make sure you always had sources

66:31

for everything and like you know highly

66:34

accurate sources fast answers uh and and

66:37

so on but everybody else is also like

66:40

realizing that the real values in search

66:43

even more than free form chat and

66:45

they're trying to put sources for almost

66:47

any any response. So I think right now

66:50

we we're in this weird phase where like

66:52

all AI chat bots seem similar and like

66:56

some people prefer one over the other

66:58

and like you know if you rank response

67:01

accuracy like I'm sure different

67:03

benchmarks will have different people

67:04

ranking one number one or two but but

67:07

consistently perplexity is is deemed as

67:11

like one of the most accurate fastest

67:12

chat bots. Um and and I'm very happy

67:15

because that's the work we've put in the

67:17

last two years. But I feel the this year

67:20

in 2025 and six the differentiation is

67:23

going to come from more agentic

67:26

behavior. Uh where like the the the

67:29

question answering like answering

67:31

questions will be seen as a commodity.

67:33

Uh some people will have preferences for

67:35

some products. Some user interfaces are

67:38

going to be better. uh those will not

67:40

just respond with text but also give you

67:42

charts and like images and inline

67:45

product cards or hotels. Uh you know

67:48

would you call that agent? Like if

67:50

somebody picks off a language answer

67:53

like a text answer from a certain large

67:56

language model and converts it into

67:58

images and makes it more No, that's not

68:00

agentic. I'm just saying question

68:03

answering itself. You can say just

68:07

responding in text is like not going to

68:08

cut it. Like let's say I'm going to ask

68:11

for the best shoes. You want to actually

68:12

see the shoes. Uh you want to see an

68:15

actual shoe card and like reviews and

68:18

like compactly summarized to you with

68:20

like options to buy. Uh same thing with

68:23

hotels, same thing with like

68:24

restaurants. You can just want to book

68:26

it right there. I think these kind of

68:27

experiences will differentiate one or

68:30

two chat bots from the rest. And we are

68:31

doing our part to be ahead of the curve

68:34

there. But I I feel like the real magic

68:36

is going to come from AI is doing things

68:40

where you can go to the AI and ask it to

68:42

play a song or like play a video or or

68:44

book a restaurant reservation, book an

68:46

Uber, book a flight, uh send an email,

68:49

move your calendar. Like say I'm

68:51

communicating to Nikquille's team. I'm

68:53

just going to ask my assistant to like

68:55

um hey can can you ask them can we start

68:58

at 8:30 instead of 8. Uh and then it's

69:00

it's just going to do this emailing for

69:02

me and it's going to do the back and

69:04

forth with your team and it's just going

69:06

to like figure it out. I'm not I'm just

69:07

like in my bed sleeping and the AI is

69:10

working for me. I think those kind of

69:11

things missing. Why is it not happening

69:13

now? It's what needs to change? It's

69:17

only recently began to take off because

69:20

uh reasoning only reasoning be recently

69:22

began to work. Uh and and without

69:25

reasoning you cannot do these things

69:26

with just uh the LLM in the traditional

69:30

sense where you get an output for an

69:33

input it's very hard to do these things

69:35

with reasoning based on my data or

69:37

reasoning reasoning based on generic

69:39

data with nuance derived from mine.

69:42

Yeah. Yeah. So context comes from you.

69:45

Uh but but um the the the the core

69:49

reasoning skill is in the model. Uh the

69:52

context of like okay your emails, your

69:54

existing

69:56

uh calendar, we need access to all that

69:58

and it needs to be

70:00

contextual. And um I think that's a

70:03

that's why product building is equally

70:05

important or probably even more than the

70:07

model here because uh there's going to

70:09

be a bunch of great reasoning models but

70:11

there's not going to be 100 products

70:13

that uh really package personal context

70:18

uh all the API integration services

70:21

integrations uh native integration to

70:23

your phone to be an assistant

70:26

uh really well voice experience like

70:28

there's just like earlier there would be

70:30

like 10 10 details to get right. Now

70:32

there'll be like 50 to 100 details to

70:34

get right. And the more details to get

70:36

right simultaneously,

70:38

Mhm. the less chance that there are like

70:40

five or six different chat bots doing

70:42

the same thing. Eventually, if all data

70:45

is democratized and all the models

70:48

consume all the data, will everyone

70:51

throw out the same answer but the

70:53

language be different?

70:55

It's already that way right now. Like

70:57

the core if you if you leave the search

70:59

part and just talk to an AI and ask it

71:02

questions, most of these models are kind

71:06

of saying the same thing. And there's

71:08

one reason for that is because uh

71:10

they're all trying to climb on the same

71:12

benchmarks. Mhm. The same

71:15

leaderboards. So, uh there's not a very

71:18

big qualitative difference. You can you

71:19

can you can squint at it and say okay

71:21

yeah I like the response style of this

71:23

one model over the other but it doesn't

71:25

matter. So what is the nuance that makes

71:28

one subscribe? I I'm a user of

71:30

publicity. I have Plexity Pro. I have

71:32

chat GPT. I have a bunch of other

71:34

models. Yeah. So what

71:37

nuance will attract me towards

71:40

perplexity versus chat GPT versus Grock

71:44

versus uh meta's AI like what is that

71:48

difference? I think it really depends on

71:50

what you're using AIS for. uh if you're

71:52

a person who use AIS um for like a lot

71:55

of fact checks and research and sources

71:58

and like financial research even you

72:00

want to get charts you want to get stock

72:02

prices balance sheets all the stuff like

72:04

whoever does this best you would

72:06

subscribe to it right who does this I

72:10

think it's us but if if you don't think

72:12

so like I would love to know uh but uh

72:15

like at the same time here's the thing

72:18

like I I I feel like

72:20

uh our product has this advantage that

72:23

we can use any model out there. Uh and

72:27

and and um it's kind of weird like you

72:29

could ask like why can't chat also have

72:31

Grock in it. I I think it's more that

72:34

they also have a different rivalry going

72:36

on of like who trains the smartest model

72:39

which is what attracts the researchers

72:41

to be working there. Um we how do you

72:44

pick which model you use for which

72:46

answer? Um we regularly evaluate models

72:49

on like so many different types of

72:51

queries. We get you're using one model

72:52

at a time and then you switch to

72:54

another. It's not that one query goes to

72:56

a certain model. Uh it's it's uh neither

72:59

like every query goes to a bunch of

73:01

models. Uh but they're doing different

73:04

tasks like one model um rewrites your

73:08

query into a more like easily

73:10

understandable format for the AIS.

73:12

Another model like does the chunking of

73:14

the pages into like

73:17

parts that gets consumed by the

73:19

summarization model and the

73:21

summarization or chat model is different

73:23

and then there's like another model that

73:25

suggests new questions to ask. So these

73:27

are all like four or five different

73:28

models working per query.

73:30

Is there a latency play in that? Like if

73:33

you're running on top of so many models,

73:35

will you be relatively slower? No. No.

73:40

Uh that's that's why that's why like

73:42

that's why even though people say it's a

73:44

rapper uh it's there's a lot of backend

73:46

infrastructure work we put in to make it

73:48

so fast I still think we are the fastest

73:50

product in uh in in among all these

73:52

because the the latency is one of our

73:57

main metrics we track internally tail

73:59

latency actually there's a concept

74:00

called tail latency which is it's a 99th

74:03

percentile that that matters it's not

74:05

the mean latency and so um one thing we

74:09

do if you throw a part of the answer out

74:11

and then there are sub questions which

74:12

you click and then you go to the next

74:14

parts unlike another model which throws

74:17

everything out at the same time. So, so

74:19

the trick that most people figured out

74:20

in AI uh chat bots is you stream the

74:23

answer one word like few few chunks of

74:26

words that way uh the user doesn't feel

74:29

the latency uh they they're just like

74:32

already begin to read the answer. It's a

74:34

it's a it's a clever hack. It's not if

74:36

you actually wait for the this happens

74:38

in voice to voice also by the way. Uh

74:41

the reason it feels real time is like

74:43

the answer is still not done yet but

74:46

it's began talking to you already and

74:48

you're just hearing it. So um I think

74:51

like the one thing that we we try to do

74:53

74:54

like a lot of the open-source models

74:57

that we serve ourselves with some fine

75:00

tuning uh we we've tried to serve it

75:03

with extreme efficiency like we wrote

75:05

our own runtimes for Nvidia chips and we

75:07

use other chips like Cerebras and that

75:10

helps us to like make the latency as low

75:13

as possible and and the fact that we

75:15

have our own index lets us pull the

75:17

links really fast uh with like

75:20

sub-second latency and so the overall

75:23

latency feels like really short uh even

75:25

though we um we do like a lot more work

75:28

on the back end. Uh but I think there's

75:31

still like some more juice to squeeze

75:33

out here. Like I I feel like in the end

75:35

of the year uh there'll be another half

75:37

a second that's shaved off here with

75:39

more improvements on the infrastructure.

75:41

To be really honest, you know, as a

75:43

user, I don't even care. Like if one

75:46

answer is coming at 300 microsconds and

75:50

another is coming at 800 and if my naked

75:53

eye can't tell the difference and you're

75:54

streaming anyway so I'm reading I don't

75:57

know how many people truly care about

75:59

that difference. So that that's the

76:00

thing you you might feel that way. uh

76:03

but when we get like 10 thou let's say

76:05

we don't get 10,000 requests but let's

76:07

say we get like

76:09

uh 100,000 requests per second in future

76:13

I think these things will matter like

76:15

because of the load uh you will start to

76:18

feel it slower too even though right now

76:20

you don't so that's where like any day

76:23

like you know like u Google has done

76:25

this historically is like anytime they

76:27

shaved off even like 100 milliseconds on

76:29

the uh Google search result page load

76:33

like like the loading time uh they've

76:36

always measured that retention

76:38

increased. It's just like you know we

76:40

don't care it's just doesn't matter but

76:42

then uh at the scale of traffic you

76:45

serve like it it'll the more

76:46

improvements you do the better uh

76:48

because that way you can handle any uh

76:51

tail cases pretty well. Makes sense

76:54

actually. Is there a way to I know this

76:57

is a probably very hard thing to

77:00

extrapolate, but is there a cost per

77:04

quiry to service that you have versus a

77:07

fee that the user pays? Like if I pay 20

77:10

bucks a month to

77:11

Plexity, how many requests do I need to

77:14

make to really consume that?

77:18

I I mean like uh I would just say that

77:20

it's

77:21

um it's not something that's a static

77:24

metric anymore

77:27

considering every 3 months or something

77:29

like there's a new open source model out

77:31

there uh and then that forces the auto

77:34

labs to lower the prices because then

77:37

nobody's going to use their APIs.

77:39

So uh it's something that's constantly

77:42

going down the cost. Mhm. That doesn't

77:44

mean that we get to mint more money from

77:47

you either because uh what happens is

77:54

um there are newer experiences that are

77:57

being built with the more expensive ones

77:59

simultaneously like the deep research

78:02

stuff. It's actually pretty expensive to

78:03

serve deep research for us. Uh we still

78:05

price it at $20 a month. I think

78:07

OpenAI's one is like slightly more

78:09

detailed on some some some queries, but

78:12

uh it's priced at $200 a month and uh

78:15

you can see right that that's simply

78:17

because Deep Seek is open source and

78:18

we're able to like actually serve it at

78:20

10x cheaper price and we'll address the

78:23

inaccuracies in the next you know few

78:25

months right like with better models

78:27

more finetuning so I I I think that

78:30

actually makes us makes the margins

78:32

lower for us on on on um on on the pro

78:36

subscription if people use deep

78:37

research. Uh but but but the while this

78:41

happens the cost per query on regular

78:43

pro searches or reasoning searches go

78:46

down because the there's more progress

78:48

on the model side and I think that's

78:50

asentic tasks like like when when when

78:52

these AI start beginning to do stuff for

78:54

you it'll definitely cost us more. Mhm.

78:58

uh and and and so we are not we're we're

79:00

actually okay with this

79:03

uh uncertainty in like what the real

79:05

margins are on consumer subscriptions

79:07

and AI uh in the short term because I

79:11

think the real thing to focus on is like

79:13

getting the experience really good and

79:15

and and and sure we would love to like

79:17

not burn the money all the way but uh

79:21

and uh like like but at the same time

79:23

hyper optimizing for margins now would

79:25

be the wrong uh tactical

79:28

If I were to like hold you to a specific

79:30

answer, if you had a hundred bucks and

79:33

you could only put it in one company, uh

79:36

you can take out the private one, so

79:38

don't put yourself in the list. But

79:39

amongst the listed players, if you could

79:41

invest a 100 bucks in them, which one

79:44

would you pick

79:48

in an AI or any any company? AI. AI. AI

79:53

adjacent because these guys do

79:55

everything now.

79:56

I think I would put it into uh meta

80:00

uh mainly

80:02

because in a world where AI works

80:05

increasingly well like I think the human

80:08

to human connection becomes even more

80:10

essential and there's literally no way

80:12

no one disrupting that in in Instagram

80:15

or WhatsApp and so uh

80:20

advertisements and in a world where

80:22

people are going to be able to ask AIS

80:24

to do stuff for them. Um, brand value

80:28

like how much a brand themselves like

80:31

get known to the user matters even more.

80:35

Uh, because like you can you can ask the

80:37

agents to just ignore all the sponsored

80:39

links on Google and and and and truly

80:42

look for like what's best, read the

80:44

reviews and stuff. So, I think what

80:46

people perceive a brand as matters even

80:48

more and peopleto people connection

80:51

matters even more and people knowing

80:53

what other people say matters more. So I

80:56

I feel like they are very well

80:57

positioned to keep their existing ad

80:59

business strong or even or make it even

81:01

stronger uh in a world where AI is

81:04

actually work. It's it's it's a kind of

81:05

81:07

interesting position to be in for them

81:09

where uh their ads business is going to

81:12

flourish even better uh when AI's work.

81:16

I wouldn't say the same for Google. Uh,

81:18

I think Google ads and Google agents and

81:21

Google ads are just completely on the

81:24

opposite ends of like

81:25

um business

81:27

incentives and and and that's kind of

81:30

why also Google has the least incentive

81:32

to bring out AI native search or agents

81:34

right there on core Google homepage or

81:38

Google apps. It can be hidden in a mode

81:40

or like sometimes firing some for some

81:43

queries but it's never going to be the

81:44

central piece of it. Meta doesn't have

81:48

this problem at all. Like they can roll

81:49

out AIS, but the core feed of like

81:52

watching what other people post is still

81:55

going to be the same.

81:58

I was thinking about this the other day

82:00

uh with all the talk about tariffs and

82:02

who imports how much and who exports so

82:04

much and the deficit that the US is

82:07

running versus India or China or many of

82:11

these other countries.

82:13

Almost every new company here in India

82:17

spends all of their marketing money,

82:20

their distribution money. Like if I were

82:22

to even start a t-shirt company, a

82:24

coffee brand, I don't know a SAS

82:27

company. Gone are the days of putting a

82:31

ad on a newspaper or a TV channel or a

82:36

cricketing team or you know like the

82:38

traditional way of spending money to get

82:40

distribution.

82:43

uh don't hold me to the numbers but

82:45

pragmatically I see that declining and

82:47

more and more money in discovering

82:50

clients in India goes to either Facebook

82:54

or Google and Meta or Google and that

82:56

money that revenue might be registered

82:59

in Facebook India or Meta India or

83:02

Google India but essentially I the way I

83:05

look at it it's trickling back to the

83:07

parent company because the the market

83:09

cap of these companies are going up by

83:11

virtue of the revenue they register here

83:12

83:14

India. Since we are at the very core of

83:17

it talking to entrepreneurs who want to

83:20

start something in India, do you think

83:22

there's a play there to disrupt this

83:24

market? Do you think that's even

83:25

remotely possible for an

83:28

Indian for someone in the Indian youth

83:30

to build something to take away some of

83:33

this pie?

83:35

Well, uh, if an Indian company started

83:38

an Instagram

83:39

or WhatsApp rival, I would very be very

83:43

impressed by the bravery of it. Not that

83:46

like I'm trying to do anything for

83:48

stupidity.

83:50

No.

83:51

Uh, well, I I I would say like what I'm

83:54

doing is similar uh where even now

83:57

people think it's a stupid idea to

83:58

compete with Google.

84:00

Uh but I think it I think there's some

84:03

angle uh that can work. Um and

84:08

um if if okay here here's how I would

84:10

see it. If you can build way better

84:13

targeting

84:15

uh than than Instagram does at least for

84:19

consumers in India that you're trying to

84:21

target for your business.

84:23

Um, and sure, like people would be at

84:27

least all they got to do is like if if

84:30

they're spending a million dollars in

84:31

ads a year, instead of spending the

84:34

entire million on Instagram, if they

84:36

spend 700K on Instagram and 300K on your

84:39

thing, like that's already a big

84:40

disruption. That's step two. after I

84:42

first garner the distribution for my

84:45

platform where people come. You need to

84:47

have one core you need to have one core

84:50

uh reason why people even post on your

84:52

platform. Yeah. What and and u when you

84:56

have zero users uh and or you're just

85:00

like getting

85:01

users the creators are like you know the

85:04

ones who are posting stuff

85:08

uh they they they want they want

85:10

traction. They want like likes and

85:11

stuff. They want

85:13

shares. And so that that's the problem.

85:16

That's a cold start problem. And network

85:17

effects. That that's kind of why I said

85:19

Meta has a bigger mode than Google. Uh

85:21

because Google's mode on distribution

85:23

comes from their deals with with with

85:26

carriers, OEMs, and like you know all

85:28

these people. Um but Meta's u mode is

85:33

just raw network effects. Like nobody

85:35

pre-installs Instagram or WhatsApp on

85:38

phones. U Android pre-installs Google

85:42

despite that like everybody goes and

85:44

installs these apps. So I think you got

85:46

to change that in some way and you got

85:48

to build a user base from scratch and I

85:51

think that's the hard part. Any ideas?

85:54

Say I want to try it. How do I do it?

85:57

Give me an angle. To be honest, I

85:59

haven't thought hard about it but let me

86:01

just try to think on the feet here. Like

86:04

the last big app that actually did it

86:06

was Tik Tok, right? Yeah. Um and and and

86:10

and and interestingly they actually grew

86:12

a lot through Instagram. Like they spent

86:16

billions of dollars of ad re ad spend on

86:18

Instagram to grow Tik Tok. And uh what I

86:21

was told is like the meta team was

86:23

pretty uh like like laughing at it like

86:26

hey like you know we grew to all these

86:29

users

86:30

organically and little do they realize

86:32

that the retention on paid users is like

86:34

like retention on acquiring users

86:36

through paid channels is pretty low and

86:39

then and they're just making us rich and

86:41

growing our stock price through

86:43

increasing our ad revenue and they're

86:45

not going to actually retain any of the

86:46

users. It doesn't matter. But that ended

86:49

up being wrong. Like they actually got a

86:51

lot of users and the only reason

86:53

Instagram is still fine is because Tik

86:56

Tok is banned in in in many countries

86:58

and particularly in India, right? Um so

87:02

I I would

87:04

say definitely you got to spend a lot

87:08

per user uh on on existing channels.

87:12

87:14

and definitely you got to have some new

87:17

unit of information that existing

87:19

platforms don't have that becomes core

87:22

to your like like reals was new.

87:25

Obviously Instagram copied it. Uh but at

87:29

least for a while it was new on Tik Tok.

87:32

Is that where publicity will go

87:34

eventually? Do you think you'll need to

87:35

have ads? I hope not.

87:40

Um, I think the market for

87:45

uh like just like an assistant

87:49

that is so personalized to you and does

87:52

a lot of work for

87:53

you. Um, gives you daily briefs,

87:56

updates, does market research for you

87:59

without you even asking for

88:01

it is massive. like uh people would pay

88:04

like hundreds of dollars a month for

88:06

such an assistant because it's kind of

88:08

like hiring a

88:09

person and if the RPO per person is so

88:13

high like $1,000 a year

88:17

uh and and if we can at least get 10

88:19

million people to pay for

88:21

it I think that's like a pretty

88:23

successful multiund billion dollar

88:24

company of its own and

88:27

um if you can do that and figuring out a

88:30

way to like grow x% a year and getting

88:33

to Google's like ad

88:35

revenue order of magnitude is certainly

88:37

88:38

achievable. Uh and so that's I also

88:42

think when an assistant truly

88:44

personalized ads are pretty easily

88:47

doable like you know the reason

88:49

Instagram ads are better than Google ads

88:51

is they're very personalized to you

88:53

right and so Instagram has done some

88:55

research that if they remove the ads on

88:57

the platform the engagement time went

88:58

down. Mhm. Um, Google has never really

89:04

publicly done any study like that. I'm

89:06

actually certain that like Google

89:09

removing ads makes the experience better

89:11

today.

89:12

But can they survive without ads? They

89:14

cannot. That's the thing. Yeah. Yeah. So

89:17

that that's where the I've kind of like

89:19

stopped using Google for so many things.

89:20

You know what I find today is I will

89:24

search for something on Google, I will

89:27

get annoyed by all the ads and then I

89:29

immediately open

89:30

up you guys or open AAI and I search

89:33

there because I feel like I don't have

89:35

to like in Google searching and

89:38

searching. That's the thing you are you

89:41

are searching first on Google, right?

89:44

Why? Because they are the search bar.

89:47

Yeah. And that's why you got to build a

89:49

browser like or you can convince people

89:52

to go change your default uh search

89:54

engine to perplexity. But you know that

89:57

what they do they always put these

89:58

popups that say hey

90:02

turn your default back to

90:05

um back to Google and then in that

90:08

there'll be two options and the

90:10

highlighted blue one would be yes.

90:13

Mhm. and and this the like retain back

90:18

would be the non- highlighted option

90:20

that would be like non-bold text. So

90:22

they have all these like tricks they've

90:24

learned through ages to like preserve

90:26

their dominance that the first query

90:28

goes to them. Another thing I would tell

90:30

you why they'll they'll still continue

90:32

to be dominant for for a few years is

90:35

let's say you do your research on like

90:37

what microphones to buy, what or like

90:40

you know let's say the best headphones

90:42

for podcast recording. You're buying

90:44

equipment for your podcast studio. You

90:47

do your research on perplexity or chat

90:48

GPT. You you you you know what to buy

90:51

now. you go and actually make the

90:54

purchase on Google or you go to Amazon

90:56

like you know most people just type the

90:58

brand on Google they click on the Amazon

91:00

link and then they go and buy there. So

91:02

who actually makes money out of out of

91:05

that research you did?

91:07

Google like not us, right? Uh because

91:11

Google makes a money every time you

91:13

click on a link and make a purchase

91:16

because they they they get to claim uh

91:18

cost perclick conversion to the

91:20

advertiser and and the advertiser is

91:23

like, "Oh, wait. I got I'm getting all

91:24

this uh you know conversions because of

91:27

Google, so I'm going to keep spending

91:28

advertisement revenue on it." So this is

91:31

the problem. Uh like you got to have AIs

91:33

that not just help you do research, but

91:35

help you make transactions natively.

91:39

um like like like and then you you got

91:40

to have AIS that are not um you know

91:44

vulnerable to the search bar placement

91:47

and and and and that's the real

91:48

challenge that you know companies like

91:50

us or chat GBT have to address. It's not

91:53

it's not the fact that they can't

91:55

provide a better product. I think it's

91:57

pretty obviously clear. I think uh you

92:00

got to be able to finish the other two

92:02

or three steps that remain to get rid of

92:04

the Google dominance and and and and

92:06

android is a massive u advantage for

92:09

them. They they don't let people sell a

92:11

phone if you don't keep Google as a

92:14

default search engine. What they do is

92:16

like they say you cannot have the play

92:18

store. Okay? If you don't have the play

92:19

store there are no apps cuz nobody is

92:21

building apps on any other play store

92:23

and so no phone maker can sell a phone

92:25

there. So, and and they don't share ad

92:27

revenue on on um on the Google search if

92:32

you don't have the Google Assistant as a

92:34

default. So, there's a lot of things

92:36

sounds like a relatively easier business

92:38

to disrupt the play store or the app

92:41

store. First of all, like what what what

92:44

most of the people are building apps.

92:47

You you can have a fork of Android

92:50

um and and ask people to publish it to

92:52

your your app store too, but they don't

92:54

get the visibility that they get on the

92:56

Google's Android because the phone

92:58

makers are still going to use Google's

93:00

Android. And the phone makers are using

93:02

Google's Android because Google shares

93:03

with them the ad revenue on Google

93:05

search that made that's getting made on

93:07

the mobile phones. Uh and you cannot do

93:10

that until you have that scale. Uh so

93:12

it's uh it's very tied to like many

93:14

things and and the more you understand

93:16

the details like you you you're always

93:18

like playing chess figuring out like

93:21

okay like what is the next move you can

93:23

make that you can make as a startup with

93:26

me lower resources uh and and still

93:28

convince the telos and OEMs to work with

93:30

you. I think that's the hard challenge.

93:33

U and and they're always there to like u

93:37

spoil your plans. But again, like I I

93:41

expect the to the question you asked

93:43

about can I can I have an Indian rival

93:45

to Instagram? You're going to have to

93:47

play you're not just going to be focused

93:50

on building a better product. um you

93:53

have to

93:54

actually spend like a lot of your energy

93:56

thinking about distribution

93:58

uh and and uh and and American any

94:02

anytime come and say like hey like just

94:04

pre-install Instagram on all your phones

94:06

and I'll I'll share ad revenue with with

94:08

you uh for Indian users and even before

94:11

that I need an angle as to how I get the

94:14

initial user base. Exactly. So it's very

94:17

difficult but I I certainly think it's

94:20

worth the like you know of of of

94:22

attempting some some brave person needs

94:25

to do it ideal if the person who begins

94:27

it is having their own audience so they

94:29

they can actually influence is Silicon

94:32

Valley people have tried it like

94:33

clubhouse um again everything has

94:39

lessons you can learn from

94:42

so the question is if text moved to

94:45

pictures move to short form video. Could

94:48

it be long form video? No, YouTube has

94:51

YouTube is there, right? Yeah. By the

94:53

way, YouTube is actually

94:56

um one of the biggest rivals to

94:59

Instagram, I would say. Mhm. Um mainly

95:02

because that was Instagram's market to

95:04

take, right? The reals. Yeah. The long

95:06

form video. The long form video as well.

95:08

Long form video. Yeah. Yeah. A little

95:10

bit. Yeah.

95:13

Um what I was told is YouTube's ad

95:16

revenue now uh comes more from TVs than

95:20

even like the actual YouTube app. Um so

95:24

so you can kind of see where people are

95:27

actually beginning to spend more time on

95:29

the TVs now. Yeah. Um than than even the

95:32

mobile app. So there's probably

95:33

something there. Uh podcasts is growing

95:37

a lot. a lot of people, of course,

95:39

you're making one of the most listened

95:40

to podcasts, but it's a it's it's it's a

95:43

thing that Instagram is not really

95:44

getting. Um, and it's going more towards

95:48

the Apple podcast, Spotify. So, there's

95:51

always like people figuring out new

95:53

forms of content that's not necessarily

95:55

going to Instagram. So, that's something

95:57

that's an opportunity if I'm able to

95:59

aggregate every Indian podcaster. Yeah.

96:02

and improve the quality

96:05

of their video by I don't know if I

96:09

include a chat function where they can

96:11

talk to the podcaster and the guest like

96:14

have some angle like that do you think

96:17

if I were to be able to aggregate that

96:19

is is it a possibility definitely um

96:23

another thing that people haven't really

96:25

tried is like live stream the podcast

96:27

like let's say we're talking now um and

96:30

and and like so the way podcast workers.

96:32

We record it, you edit it, we post it,

96:35

and then people are listening to it, but

96:37

there's no communication between us and

96:40

them, right? Um, and and like X tried

96:44

that with with live stream and and and

96:47

you know, like so Instagram has it too,

96:49

Instagram live, but it's not really

96:51

podcast podcast. Right. Right. Right.

96:53

Yeah. Uh but there there's something

96:57

where like you can consume all the

96:59

podcasts. You can also talk to the

97:01

people who did the podcast. They would

97:03

respond to you on the comments. Um you

97:07

you can you can probably

97:08

like I say, "Hey, I want to hear this

97:11

Nikil's thing in in like uh but but only

97:15

the parts where he talks about AI."

97:18

And it'll just edit it like really fast

97:20

and just make a new version. cuz you

97:23

just listen to that. Um, that's

97:25

something YouTube doesn't do well. The

97:27

what do you call when you convert video

97:29

to text? There's a word for the the

97:31

transcriptions. Yeah, they don't do

97:33

great at transcriptions, right? Yeah,

97:35

but that again I'm just saying like why

97:36

do you even need to see the transcript?

97:38

Transcripts are there because it's a

97:40

hack to get to what you want. But if you

97:43

literally just enter a prompt and say uh

97:46

just make a version of this podcast for

97:48

me that edits out what Arvin and Nikl

97:51

talk about uh AI or like neural networks

97:54

and and it just creates that segment and

97:57

they just listen to that uh and they

97:59

would happen now cuz I thought most

98:02

large language models are text they're

98:03

not consuming video yet. Right. Exactly.

98:06

So you you don't need the video part as

98:08

much. You just need to make sure the

98:10

transcript is pretty accurate.

98:12

um or or or even take the uh MP3 file,

98:15

the audio file and then uh the long

98:18

context is good enough to consume all of

98:20

it. Uh and then you just say um I want

98:23

only these parts out and it'll tell you

98:25

the timestamps and you take that and

98:27

make a video out of it. It's it's it's

98:29

going to have rough edges. I'm sure it's

98:31

not going to work perfectly, but uh with

98:34

with but with with some engineering you

98:36

can make something like this happen. uh

98:38

the the hard part honestly in kill is is

98:40

the you know you got to start from

98:42

scratch you got to uh create incentives

98:45

for people to like consume stuff so

98:47

there's some something new new element

98:49

out needed and then a lot of sharing on

98:51

existing platforms of how this is the

98:53

next big thing and and and but you're

98:55

right there's if there's one way to

98:57

aggregate all the podcasts that's

98:59

happening in India on one platform and

99:02

people like being able to edit it

99:03

listening in any new language they want

99:06

on the

99:08

uh it might be a big uh product market

99:10

fit there. Very interesting. I'm going

99:13

to ask you another personal question. I

99:16

have a private equity fund. We're

99:17

reviewing a data center

99:19

business fairly large something that

99:22

does maybe a hund00 million of EITA

99:25

right now. So data center has become

99:28

such a thing in India. Arind uh every

99:31

real estate person that you speak to or

99:33

I speak to

99:34

today, everyone's talking about data

99:36

centers. It's like

99:39

it's like

99:40

the real estate almost in in the 2025

99:44

version, the big big thing for them is

99:46

not this new building, but it's building

99:47

a data center.

99:50

uh if you're able to buy data center

99:54

businesses at a 20 multiple of EIA or a

99:58

25 multiple of

100:00

EITA, would you do it today? Is there

100:02

something I'm missing? Is there

100:04

something changing in terms of how data

100:06

is being compressed? Quantum computing

100:08

or compute moving out of the data center

100:11

that one should not do it. I wouldn't

100:14

really worry about quantum computing

100:16

right now. Uh I I think it's still in

100:18

pretty early

100:19

stages. Um I certainly think India

100:22

should have its own data centers like

100:24

like there's no um reason not to. Um and

100:29

um definitely calls for good real estate

100:33

expertise. Um infrastructure uh buildout

100:37

is not easy. uh buying the chips

100:42

uh connecting them

100:45

the making sure you use the right

100:47

technology for the interconnects between

100:49

these different

100:51

GPUs building these server racks. I mean

100:54

uh compute centers in in in different IT

100:58

have done this like like you know we had

101:00

a clust compute cluster that we had

101:02

access to in IIT and it would live in

101:04

the computer center. So definitely

101:07

doable.

101:08

Um and um it it really depends on like

101:13

Okay, so there's this company called uh

101:15

Core Vivve in in the US. I think it's

101:17

going to IPO pretty soon. It's the first

101:20

like pure data center play that that

101:23

I've seen. Like it's not a it's not a

101:26

big tech data center. Uh Nvidia owns a

101:29

big chunk of this company. Um and and I

101:32

I think like the the way they compete

101:34

against the rest is they do the

101:36

buildouts

101:38

faster. Uh and and uh OpenAI is using

101:41

them, a bunch of others are using them.

101:43

So if you can provide training GPUs to

101:47

people in India uh much faster and and

101:50

and

101:52

like cheaper prices potentially cheaper

101:56

because the data center buildout costs

101:58

might be lower cuz labor costs are

102:01

lower. Uh there there's probably

102:03

something there and I hope at least for

102:05

inference it makes a lot of sense

102:06

because data sovereignty might be a

102:09

thing. So let's say even for companies

102:11

like us in future if the government of

102:13

India wants like the data of uh people

102:16

using perplexity India to stay in India

102:18

then it makes sense to have like you

102:20

know even American companies or for

102:22

other other companies outside India to

102:24

be using the data centers built out in

102:25

India so that the data is stored in

102:27

India I I think it'll happen eventually

102:30

invariably uh now the financial data

102:33

sits out of India and India creates

102:36

something like I don't know 20% of all

102:38

the data because of the number of people

102:40

with

102:41

smartphones. So the assumption is it

102:44

will happen and hence everybody is

102:46

talking about the data center business

102:48

but structurally there is nothing that

102:50

102:51

changing in the data center business. I

102:55

don't expect it to be a pretty high

102:57

margin business of its own unless you

102:59

combine it with good software.

103:02

And what would software look like for a

103:04

data center? Is it like spin up spin up

103:07

jobs? um easily host models

103:11

um have the Kubernetes support for like

103:14

scaling instances

103:17

uh that's kind of what the cloud

103:19

companies have shown right um maybe in

103:22

the short term if you're the only one

103:24

who can provide a data center in India

103:27

Mhm. you're going to enjoy good margins

103:30

but long run you should expect more

103:33

people playing the game. Yeah. No, there

103:36

there are many providers already. There

103:38

is maybe a gawatt worth of data centers.

103:41

I mean I'm not sure of the exact number

103:43

but it has scaled significantly. The

103:46

question is does it continue to grow in

103:50

this manner where at the end of the day

103:53

it's a very commoditized business. It's

103:55

almost like a real estate company

103:56

starting a warehouse. I'm I'm not able

103:59

to distinguish if one has IP over

104:04

another

104:06

until you have like some vertical

104:08

integration done pretty well. And the

104:11

other big worry is does this become such

104:13

a big business that the hyperscalers

104:16

build their own and do not go to a third

104:17

party vendor? Possible. I mean

104:20

hyperscalers actually build their own

104:22

data centers everywhere. uh except where

104:25

there are like real constraints where

104:27

they have to move super fast and and um

104:31

and uh the only way to do that is to

104:34

like work with someone else locally and

104:37

and or like there are like local

104:38

regulations and restrictions on like

104:40

what other companies can do in physical

104:43

spaces and and like

104:46

um someone like who's already there

104:49

who's

104:50

who's an Indian business can be the one

104:53

only one who can do that on on the

104:55

timelines they want.

104:57

Do you have a view on Nvidia

104:59

like the the margins that they operate

105:03

at and the scale of revenues and

105:05

profitability they are at? Why has there

105:09

been no disruption? I think it's pretty

105:13

hard to do uh what they do at the

105:15

margins they have. Uh that's the main

105:18

reason. They have a very flexible chip.

105:21

It can do a lot of computations. It's

105:23

not just about inference. It's not just

105:25

about training. It's not just about like

105:27

dense models or mixture of expert sparse

105:29

models. It's not specialized. So it's

105:32

very general. So you can do a lot of

105:33

things with one one chip. And they have

105:36

perfected the art of like the

105:38

interconnects, the data center

105:39

buildouts.

105:41

Um and

105:43

um I think software is a big advantage

105:46

for them too. the fact that CUDA is such

105:49

has a big moat and um uh people

105:53

developers are all like trained and

105:55

program like to to already learn to use

105:57

CUDA. It's very hard to go learn a new

106:00

software stack and they keep a lot of

106:02

the CUDA stuff like closed source. It's

106:04

pretty hard to like you know replicate

106:06

it. Um and then by the time you do all

106:09

the work in like you know going and

106:11

building your own software stack and

106:13

your own hardware and making it pretty

106:14

general they have the next generation of

106:17

chips and then they already have the

106:18

relationships with all the hyperscalers

106:20

to get their chips in for the as as

106:22

first priority right so it's uh they're

106:25

competing on many levels it's it's

106:27

pretty difficult but uh recently what's

106:30

happened is like at least on the

106:31

inference layer uh there are like some

106:34

alternatives like cerebras is there and

106:36

the gro with the cube you um and again

106:41

they're enjoying the time period before

106:43

Blackwell comes to the market and

106:45

Blackwell chips are supposed to be uh

106:48

way more efficient than H100s for

106:50

inference. So maybe all the things are

106:52

going to be shortlived. We never know.

106:54

uh certainly the margins might get

106:57

affected but also the raw AI usage

107:00

adoption and how many others are going

107:02

to build AIS is also going to grow that

107:05

uh the company might still be a very

107:07

lucrative business to invest in but it's

107:10

one of the least understood stocks I

107:13

would say even though there's a lot of

107:14

energy and effort being put into

107:15

understanding it uh it's one of the

107:17

least understood stocks and it's pretty

107:19

volatile to AI progress like AI progress

107:22

needs to keep happening at the same pace

107:23

for Nvidia uh to be uh going up again

107:27

and again and again.

107:29

I mean the earnings also to a certain

107:32

degree seem to have caught up right

107:34

they're at like 40 times one year

107:35

forward earnings which is not ridiculous

107:37

like it once used to be. Yeah. I I tried

107:41

to learn about Nvidia like correct me if

107:43

I'm wrong but the big distinction

107:46

between Nvidia chips and the the

107:49

incumbents of five or 10 years ago was

107:53

the fact that they did tasks

107:56

sequentially and Nvidia does many task

107:59

at the same time. Is that the main

108:01

difference? Um that's one way to put it.

108:03

Uh mainly Nvidia specialized for

108:06

graphics.

108:07

uh graphics is a lot of matrix

108:10

multiplications. This is how the math

108:12

math works. Matrix multiplications is

108:15

parallel

108:16

computations. And interestingly, this is

108:18

a very interesting coincidence. It's not

108:20

designed to be this way. Uh neural

108:22

networks are also a lot of matrix

108:24

multiplications.

108:25

So because they specialize matrix

108:28

multipliers to be fast for graphics that

108:32

core set of primitives that they built

108:35

ended up being extremely uh a great fit

108:38

for AI like neural nets. If AI was not

108:42

neural nets then GPUs wouldn't have

108:44

mattered but but AI happened to be just

108:46

basically neural nets at scale. And so

108:48

all the primitives they built, all the

108:51

uh software stack they built ended up

108:53

being like the the core foundational

108:55

building blocks for neural networks too.

108:58

All the neural network training

108:59

libraries were built around it. Now it's

109:02

so hard for someone else new to come and

109:03

change it. The only one who's managed to

109:05

do this I would say is Google. Uh they

109:09

built their own chips. They build their

109:10

own software around it called Jax. And

109:12

then they um build their own accelerated

109:16

linear algebra library called

109:18

XLA. Uh and then you know they have

109:21

their own data centers too. Uh so

109:24

they're the only ones who managed to do

109:26

everything full stack completely

109:28

independent of the Nvidia

109:30

library and Nvidia's chips. Everybody

109:34

else had either one or two of the pieces

109:36

but not all of it.

109:40

Right.

109:41

Also what is India's role in all this?

109:44

Like say like I said earlier like this

109:48

is genuinely how I feel. You know Jenz

109:51

uses a particular word they say FOMO all

109:54

the time like fear of missing out. I

109:57

face that on a daily basis cuz I keep

110:00

reading about AI. But it does feel to me

110:04

like, you know, the match is happening

110:06

in another geography and I'm I'm talking

110:08

to the commentator's friend about what

110:12

is happening or reading what he's saying

110:13

on Twitter or

110:15

X. What what can India do or what should

110:18

it do? And you can be like honest about

110:21

this like because there's something we

110:23

want to incorporate and we want young

110:25

people to go out and try at least.

110:29

Uh I I've said this before. I I think

110:31

India should definitely train its own

110:33

models

110:35

um and not but wouldn't we arrive at the

110:37

same answers

110:39

that the incumbent models are arriving

110:42

yet if the data is largely democratized

110:44

and our data is also part of the

110:47

training pool.

110:49

It doesn't matter. I I I think we should

110:52

still build our own models because

110:55

there's so much more work to do on the

110:56

models to make them reason and think and

111:00

and and be good at things they're not

111:02

good at today and and and be more

111:04

agentic and do tasks and stuff like

111:06

that. Um and India should have its own

111:09

like deepseek like company uh that that

111:14

um trains models and like competes not

111:16

just on Indian languages but on global

111:19

benchmarks and that'll inspire the next

111:22

generation of like engineers to come and

111:25

work in those companies and and and

111:27

build out the future outside of

111:29

fundamental models. I'm I'm guessing

111:32

this requires serious hardware and a

111:35

reasonable amount of Yeah. data centers,

111:37

chips, models. What does somebody young

111:39

do? Like say a 25 year old boy or girl

111:42

sitting out of Bangalore or Chennai or

111:45

Mumbai or Delhi. What do they do

111:48

specifically like today with no

111:51

resources? I would hope like they can

111:54

raise some venture funding and try to do

111:56

something. Let's assume they're able to

111:58

raise a million dollars cuz AI is hot

112:00

right now. then

112:02

well it's pretty hard to do something

112:04

meaningful with a million dollars but

112:06

certainly doable um the way I would do

112:09

it is I would build a product that's

112:11

pretty interesting and new uh get users

112:14

raise more money um get more users and

112:18

raise one little more money and then

112:20

start to build your own models uh start

112:23

with post- training on top of open

112:25

source models then start to like look

112:27

into pre-training too and um then get

112:30

into the data centers. Like it's a

112:31

multi-stage process. That's what I would

112:34

do if I could start small. But if you're

112:37

already established, if you're not like

112:38

this 25-year-old young person, if you're

112:40

already somewhat established, you have a

112:41

presence, you have a name in the field

112:44

or or or able to attract investments of

112:46

higher magnitude, then I think you can

112:48

go for the more ambitious targets right

112:50

away.

112:51

Is there any like nuanced lowhanging

112:54

fruit that Indians are not taking

112:56

advantage

112:58

of who want to start off?

113:02

I don't know maybe language maybe we

113:04

have access to I think voice uh most of

113:07

the AIS are pretty bad at Indian voices

113:12

uh the the speech recognition and speech

113:14

synthesis are not necessarily good. Mhm.

113:17

That's a place where you can make a

113:19

clear difference because it's not a high

113:22

priority for the western labs to make it

113:24

work and they're like so many dialects

113:27

and languages and like I think Indians

113:29

are also more

113:31

um mobile app users and so voice is a

113:34

more natural form factor of interaction.

113:37

Mhm. So really having that amazing real

113:40

time AI voice synthesis but

113:43

u broadly like support for all the

113:46

Indian languages nailing the dialects

113:49

and accents and grammar would be a big

113:52

deal. Um it's it's easier said than

113:56

done. It's not it's not as easy as is

113:57

collecting data. You have to do a lot of

113:59

evals and training and like iterations.

114:02

Mhm. But it's definitely something that

114:04

will matter a lot for the Indian market

114:06

more than anybody else.

114:08

Because you're a investor as well. Would

114:10

you buy Nvidia stock today? I have

114:13

exposure to it. Mhm. So, uh I'm I'm not

114:17

selling, I'm holding. And I think I

114:19

believe

114:19

in like basically everybody's going to

114:22

try to build super intelligence and

114:23

general intelligence. And uh Mhm. I

114:26

think even if RL is working, I think you

114:29

need a lot of compute to do it. And so

114:32

is SF is SF petty? Like if you said

114:35

something bad about Nvidia, would you

114:37

get lesser chips when you needed them?

114:41

I've not done that, so I don't know. Um

114:43

but I I think not. I hope not. Yeah.

114:47

Right. What about the Indian outsourcing

114:51

giants? Think of Infasis, TCS, Vipro.

114:56

What happens there? If I think they're

114:59

just going to use AIS and what happens

115:01

to all the a all the people who are

115:04

there and if AIs are able to replicate,

115:06

they're not going to hire as many people

115:08

going forward. But the use case for a

115:12

American company outsourcing to the

115:14

Indian company to begin with was if one

115:17

were to assume cheaper cost of labor.

115:19

Yeah. And now maybe a agent does what

115:24

the then what happens to these companies

115:26

on the whole?

115:28

Um certainly less like they'll have to

115:32

charge

115:34

less. Um some of it is actually based on

115:38

like relationships. So like I know these

115:40

AIs can do some of these things but I I

115:43

would still trust you guys to do it

115:45

without any bugs or

115:47

errors and

115:50

u you know like until AIs are at a point

115:54

of reliability where you

115:56

just have no arguments not to use them.

116:00

I feel like humans will still trust

116:02

other human businesses to do stuff for

116:03

them, but they'll just push them to

116:05

like, hey, like now that AI can use

116:07

this, why do why do you guys need like

116:08

three months to get it done? Get it done

116:10

faster. Like, why do you guys need to

116:12

charge us this much? Um, charge us

116:15

slower. I think they're going to push

116:16

more on those shorter term trends rather

116:19

than saying, "Hey, I don't I don't think

116:21

we need you guys anymore." Like just

116:23

it's interesting that you say you guys

116:26

now at this point in life, do you view

116:28

yourself as an American or Indian? No. I

116:31

I I don't mean like you guys in the

116:34

infos. I don't mean it in a bad way. I

116:35

don't mean No, no. I think let me be

116:37

clear. I don't mean we live there. Yeah.

116:40

Yeah. So the for the previous statement

116:42

I want to say it was simply between like

116:44

what would the

116:47

uh software

116:49

vendor say to the software provider I

116:52

mean you guys in that sort of way. It

116:54

could mean by the way there are

116:56

companies like Infosys smaller scale

116:58

that do it in America too where like

117:00

literally say somebody wants to move

117:02

from data bricks to snowflake and AIS

117:05

cannot do the code translation a human

117:06

firm is actually doing that for them

117:09

right but as somebody who has never like

117:11

I want to ask you like a favor at the

117:13

end of all this but as somebody who has

117:15

never lived in the

117:17

west if you live there for long enough

117:20

does it become

117:23

Like would one be conflicted in in what

117:27

you associate with?

117:30

I mean you certainly change as a person

117:32

like you're not the same person anymore.

117:34

Obviously you're having a different uh

117:37

outlook towards life and the world in

117:39

general but you are like uh rooting

117:43

obviously for India to succeed and I I

117:45

don't see zero sum game between India

117:47

and America actually

117:50

um American businesses benefit a lot

117:52

from Indian users

117:54

Indian businesses benefit a lot from

117:57

American

117:58

technology and

118:01

um so there's certainly like lot

118:04

positive sum games to be played here.

118:07

And uh so I'm

118:10

actually it's like one of those rare

118:14

118:15

combinations that end up behaving this

118:20

way. Uh not every country in the world

118:22

is like you know super friendly or like

118:24

non-competitive with America. Yeah. And

118:27

India is like pretty lucky to be in this

118:29

position

118:31

with AI changing so many sectors and you

118:33

know it's it's kind of like replotting

118:36

the

118:37

map like our crowd is fully

118:41

entrepreneurship oriented right like

118:43

want to be entrepreneur crowd all of us

118:47

is there a sector that has tailwinds

118:49

amidst all this I'm thinking think

118:51

anything I could start a restaurant I

118:55

could start a steel company. I could

118:58

start a SAS business. I could start a

119:01

t-shirt brand. I I could start anything

119:04

like Yeah. Is there a sector with

119:07

tailwinds

119:09

where I will be served well in

119:12

attempting entrepreneurship in the next

119:15

decade?

119:17

I think there's going to be a lot of

119:20

personalized apps built. Mhm. Like can

119:23

you elaborate? Right.

119:25

Uh right now if you want an app to work

119:28

for you, what you do is you go and file

119:31

like customer support bugs or like you

119:33

comp you you add them on Twitter and

119:35

say, "Hey, this is not good. I want

119:36

this. I want that." Like and what the

119:39

app developer usually does is like they

119:41

have their own road map and they see the

119:43

customer feedback. They look at the

119:45

dominant feedback and then they try to

119:46

prioritize that into their road map. But

119:49

that feels inefficient. And in a world

119:51

where AI can just write any any

119:53

software, I can build my own software.

119:56

Like I I can have my own kind of fitness

119:58

app that'll work for my needs, you know?

120:01

It'll know what I don't like working,

120:03

what I don't like doing, what what kind

120:05

of workouts I like, what how I feel that

120:08

day. And like I can program it to work

120:11

for me. Same thing with health. Same

120:14

thing with like tutoring. like I I can

120:17

have my own personal tutor for any

120:19

topic. Maybe I don't know anything about

120:21

finance and maybe I want to get up to

120:23

speed and like I can tell it precisely

120:25

and and you know I could try that with

120:27

perplexity chat GPT2 but

120:29

then what if it doesn't actually tell it

120:32

in the way I want and I I want to be

120:34

able to build my own app for me. I I

120:35

think that layer is still not taken off

120:38

but it's certainly something that's

120:39

waiting to happen because as you can

120:41

clearly see software creation is getting

120:43

a lot easier. So someone's going to be

120:45

able to be that platform for deploying

120:48

all these things in a secure way and and

120:51

and and then people sharing apps that

120:54

they have built for themselves with

120:55

others and like that's some social layer

120:58

around it too and um I don't think

121:01

anyone's really cracked this and this

121:03

might be a

121:04

huge huge market by itself and and I

121:08

don't know how monetization is going to

121:10

exactly work here. Or is it like micro

121:11

payments where if I use the app, you

121:13

create it, I pay you. I don't know. Um

121:16

or is it going to be more traditional

121:19

like ads where different people are

121:21

advertising to each other? It's it's not

121:23

clear. But what is clear is people are

121:26

going to create a lot of personal stuff

121:28

for them or the group of friends like

121:30

imagine I just

121:31

wanted an app to like split my payments

121:34

with friends, right? Earlier you would

121:36

go and use split wise, but what happened

121:37

before split wise? You would do it all

121:39

manually. Now I can just create a

121:41

splitwise app that's more custom and I I

121:44

don't have to like be like oh split wise

121:46

doesn't have this feature. What if I can

121:48

just directly like you know build like a

121:51

better version of Venmo or like

121:53

something like that right? I think I

121:55

think these are the kind of things that

121:56

I feel even within businesses and

121:59

enterprises like if I want to track the

122:02

vacations people are taking I I someone

122:04

else need to have built a vacation

122:06

tracker SAS app and you know it's not

122:08

needed I can just build it myself. Uh so

122:11

all this stuff is going to change a lot

122:12

and and and we're not yet

122:14

like yet there at this moment because

122:17

there's still bugs there's still things

122:19

to fix. How do you deploy the app? Okay,

122:22

Claude can write the code for you, but

122:23

you have to actually deploy it. You have

122:25

to actually be in charge of the where

122:28

the data is living, all that stuff, but

122:31

someone's going to abstract all these

122:32

details out for you. It's going to feel

122:34

super seamless and and I think that's in

122:37

my opinion, this is this this is the

122:39

thing that will take off very quickly,

122:43

but it's quite not there yet. Can you

122:45

name a couple of apps that as somebody

122:48

who doesn't understand technology too

122:51

much such as me that I have to use to

122:54

get better better at business, better

122:57

more efficient as a

123:00

person? I mean I I would love to say

123:02

perplexity but I use perplexity already.

123:05

Okay.

123:06

Um I think you should definitely give um

123:10

a shot at um cursor. It's like a coding.

123:14

What does cursor do? Cursor is a coding

123:17

assistant. Like you can it helps you

123:20

write

123:20

code with an AI. Even if I know nothing

123:23

about writing code, right? You can just

123:26

go and ask it to say, hey, I want to

123:28

build a website with so and so generate

123:32

the code for me. But if you're like, I

123:34

don't even want to I don't even want to

123:36

like be in charge of deploying it. I

123:38

think there are some there's this thing

123:39

called replet or bolt where you can just

123:42

go and describe an app you want to build

123:44

and the agent will build and deploy it

123:46

for you and I think that's where things

123:49

are heading to bolt um bolt b or replet

123:54

re lit t and uh sure it's not going to

123:58

work perfectly um but I I feel like this

124:02

is where things are headed where I can

124:04

just you don't have to be a software

124:07

engineer anymore to build an app. Mhm.

124:10

And that's you know a little bit of

124:12

coding or a little bit of maths maybe.

124:15

No. No.

124:18

But will I be able to produce an app via

124:22

this method which is as good as somebody

124:24

who is a software engineer? Not today.

124:27

Right. As good as maybe like

124:30

um lower level or like lower tier

124:32

software engineer. Yes. Mhm. are not as

124:35

good as like the the the the good ones

124:37

or the be best ones. So if I were to

124:40

have a kid, I shouldn't send him to a

124:43

engineering college to study coding.

124:46

I think it still helps to be very good

124:48

at infrastructure backend

124:51

uh data centers like uh flo flo flo flo

124:56

flo flo flo flo flo flo flo flo flo flo

124:56

flo flo flo flo flo flo flo

124:56

floatingpoint arithmetic storage all the

125:00

core fundamentals are not going away. In

125:02

fact, like I would say they're very

125:03

essential in a world where AIS are

125:06

taking care of the front end and the UI

125:08

and um all that stuff because you have

125:12

to know where the data lives. You have

125:13

to know like how it is stored. You have

125:15

to know how it's deployed and you have

125:17

to know if a system goes down, how to

125:19

fix it, debug it. Those things are still

125:21

useful. Mhm. And last one or two

125:28

questions. What is the future? If you

125:31

were to like predict the next 5 years,

125:33

do you have You must have thought of

125:35

this. Yeah, I think we'll all have like

125:38

a personal assistant. It's going to feel

125:42

really amazing. Um, it's not going to be

125:44

a luxury thing anymore. Um, it's not

125:46

just a thing billionaires had access to.

125:49

It's going to feel like an iPhone where

125:52

the same phone that that the president

125:54

of the US uses, you're going to be able

125:56

to use too if you it's not and and by

125:59

that I mean it's going to be pretty

126:00

affordable and uh that's going to make

126:03

life a lot easier. Um and and

126:06

um people are going to be able to build

126:10

personalized things for them. Um and um

126:14

that's been a lot more creative

126:16

expression like what whatever you want t

126:19

exists in the world you can make it

126:21

happen. Not not everyone in the world

126:23

earlier used to be able to make

126:25

something happen when they wanted to.

126:28

They would use other people's creations.

126:30

I think that's going to change and

126:32

that's that's going to feel very

126:34

utopian. That's the nice part of it. The

126:38

dystopian part of it is

126:40

uh unfortunately in the short term

126:42

there's going to be a lot of labor

126:44

displacement

126:46

uh not as many people are needed to get

126:48

a work done anymore. Uh and so how

126:52

people upskill themselves and adapt

126:56

uh those who using AIS are definitely

126:58

going to be well

126:59

positioned. Um so all that stuff is

127:02

going to take place and how people react

127:04

to it. It's already like you know not

127:06

you don't need um to build 10,000 people

127:09

companies to be a trillion dollar

127:10

company

127:11

anymore. So definitely where where are

127:15

the next generation of graduates getting

127:16

jobs existing big techs are laying off

127:19

people or like not hiring more. So all

127:22

this stuff is definitely going to impact

127:24

like the market and um it's very

127:29

interesting that simultaneously while

127:31

creating new value and making software

127:33

creation easier and uh we're also

127:37

like displacing existing labor and

127:40

value. So how people deal with all this

127:43

is going to be interesting to watch and

127:46

and u I don't think anyone really knows

127:48

how it'll all play out.

127:51

Will the world be more complicated

127:53

if a lot of this power access and

127:58

determining the path forward

128:00

is the decision making lies upon one two

128:05

geographies like is playing out today.

128:08

I think

128:10

uh the I think the technologies will be

128:12

broadly accessible uh and and the

128:15

secrets are not going to be lying in one

128:17

or two places and open source will

128:20

ensure there's sufficient distillation

128:22

to the rest of the world. I think what

128:24

won't

128:26

be democratized is access to compute

128:29

mainly because it takes a lot of money.

128:32

Mhm. uh and um that really depends on

128:35

which countries choose to invest early

128:37

on and later on in the process. Right? I

128:41

don't I don't know what to ask you. I

128:43

have in my notes that I should ask you

128:44

about regulation and the future of that.

128:46

I don't know how to I've read a fair

128:50

amount about this

128:51

and how a lot of people think that the

128:56

incumbent AI players are are trying to

128:59

use it as a mo almost and capture the

129:01

regulatory thing like do you have any

129:03

view on this like how should regulation

129:06

let's say the government of India is

129:08

listening or watching this show what

129:11

would be the right way for them to

129:14

regulate AI and then B what is the right

129:16

way for America to regulate AI.

129:20

[Music]

129:21

Um I mean I I I think like regulating

129:24

models is not necessarily a great idea.

129:27

Uh and it's not going to work in

129:30

practice either. Uh people are still

129:32

going to be able to download a model and

129:34

use it. Uh I I think the best way is to

129:38

regulate applications like uh personally

129:41

what I feel is pretty

129:44

um concerning at this point is

129:50

probably people using chat bots when

129:53

they're kids and developing like

129:55

relationships with them. um and and like

129:58

feeling suicidal when when they don't

130:01

get to like enjoy the chat bots anymore

130:03

or they don't respond in the way they

130:04

want

130:05

to and

130:09

um kind of like taking your lon

130:12

loneliness out on like an AI

130:14

chatbot all that stuff is pretty I I I

130:18

find it concerning like maybe some

130:19

people don't and they don't care and

130:21

they just think this is not any

130:22

different from how the internet used to

130:25

be But I think it is different. So

130:29

thinking about that application and like

130:31

how do we make sure AI usage by kids is

130:35

done on apps that you know are

130:38

productive and useful and knowledge

130:40

enhancing rather than feeling too

130:42

companionship like is worth thinking

130:45

about.

130:47

Um I don't think like other stuff is

130:50

worth regulating as much today.

130:53

[Applause]

130:53

[Music]

130:55

Um and

131:00

uh we we're kind of still like very

131:03

early in AI

131:04

that moving slows is going to cost us a

131:07

lot long term and lot means like

131:10

hundreds of billions or trillions of

131:12

dollars. So, it's best

131:15

to keep accelerating right now and be

131:19

mindful of like use cases like what I

131:22

described that are clearly like

131:24

dangerous, but more otherwise like be

131:28

pretty open-minded and build stuff and

131:31

see how things play out. And I don't

131:33

have a different answer to America or

131:35

India. I think it's the same answer

131:36

here.

131:39

Will the world get to a point as it gets

131:41

more complicated that we all try and own

131:46

our data a bit more where like let's

131:49

assume a a model today is scraping data

131:53

from across the

131:54

internet will the world go in a

131:56

direction where Indians own Indian data

132:00

maybe like another country owns their

132:02

data and every model has to pay a fee to

132:07

use that data as an input put to train

132:09

their models in the sense will things

132:11

move behind a pay wall or even if they

132:13

don't mind move behind a pay wall will

132:15

there be a

132:17

fee it's possible um I I I think like in

132:21

general the internet has

132:23

been global and fair use so far I don't

132:28

expect it to change

132:32

um I think if there are some tokens that

132:35

are pretty

132:37

valuable and and and then people might

132:41

want some kind of like token payment for

132:44

it. Uh it probably won't be on the

132:47

internet. That's my

132:49

guess. Don't you find like that's

132:51

happening already more of it? Like right

132:53

now I find so much on the internet which

132:55

appears interesting but it's behind a

132:57

pay wall. But the question also will be

133:00

that me as an

133:02

individual if I consume behind a payw

133:05

wall a model which is then in turn going

133:08

to distribute what is behind a payw wall

133:11

should they pay the same fee or should

133:12

they pay different fees? I genuinely

133:15

don't know because like the models are

133:17

definitely like training on the content.

133:19

133:20

they're those who are training

133:22

foundation models, they're not just like

133:24

consuming the content once. They're

133:27

actually like distilling it so they

133:29

never have to consume it

133:31

again. So it's a different kind of

133:33

consumption to a human just reading an

133:35

article,

133:37

right? But even when I read an article,

133:40

I consume it once in dist. Yeah. But

133:42

like your memory and the model's memory

133:43

are not comparable,

133:46

right? But I'm not distributing it.

133:49

But kind of like you you might you might

133:52

share the article with someone else like

133:53

say hey did you read this news so you're

133:56

attributing to it or you're going to

133:59

use the wisdom you learn from it in some

134:02

manner. Um I mean in perplexity that's

134:05

why we s we attribute it to a source

134:07

like we we we don't like say it's our

134:10

content and that way we give credit to

134:14

the source and we're not actually

134:15

training on the data but chat GPT is

134:18

different they they actually train on

134:19

all the data right okay last question

134:24

Arvin because I'm feeling so left out in

134:28

all of

134:29

this do you think it might be possible

134:32

for me to come be an intern, work for

134:35

maybe 3 months at Perplexity free of

134:37

charge. Well, you're uh way more

134:41

accomplished for doing that. But uh No,

134:44

but I'd love to like this is genuine.

134:45

Like I feel like I'd love to come live

134:48

there for a couple of months, learn some

134:50

stuff, and come back cuz I do feel like

134:53

I'm not learning enough right now. I

134:56

mean, we'd be very honored to have you.

134:58

And um I think um I'm not joking. No,

135:01

I'm just going to like be there in the

135:03

next 30 days maybe. Sure. Every day.

135:07

Would love to host you. Uh I guess I'd

135:10

just

135:11

say I I I love the spirit of how you're

135:14

like uh having this learner mindset. I

135:17

think it's very inspiring and

135:18

refreshing. So I don't think there's a

135:21

lot you're missing out on. The internet

135:23

has pretty much everything out there.

135:26

Uh, and the world is like running super

135:29

fast that like physical access matters

135:31

way less anymore. I think it's more the

135:35

amount of time you get to spend yourself

135:37

with an AI model using these apps,

135:41

understanding where they fail and uh

135:44

talking to the best people.

135:48

But interestingly, like X has all of

135:51

them literally talking all the time real

135:53

time. It's uh pretty nuts. So uh so it's

135:56

not so much as learning

135:59

from the model but being around people

136:02

who who know what they're learning or

136:04

who are learning what should be learned.

136:07

Yeah, definitely it'll be inspire like

136:09

like very refreshing to spend time and

136:12

and and get a sense of the feel.

136:15

Super. But thank you for doing this and

136:18

thank you. Uh you're going to be in

136:20

India soon. So if I'm not there I'm

136:22

going to host you when you're here in

136:23

India. Yep.

136:25

Done.

Watch on YouTube

Nikhil Kamath ft. Perplexity CEO, Aravind Srinivas

Transcript

View From The Top with Aravind Srinivas

Perplexity CEO Srinivas on Winning Search With AI

AI Search Industry Trends