The Biorevolution Podcast

Transcript

Back to episode

00:00:04: The Biorevolution podcast.

00:00:06: Your hosts.

00:00:07: Luise von Stechhoch.

00:00:08: And Andreas Reuchler.

00:00:13: The Biorevolution podcast, this time into the dark, finding meaning within the depth of our podium.

00:00:20: This time we have a very, very interesting guest.

00:00:24: He comes from the US, slash India.

00:00:26: He comes from India originally, but has been in the US for a long, long, long time right now as an academic, as a scientist, as a researcher.

00:00:36: And as always easy, we start with a quote this time from our guest

00:00:42: from

00:00:43: a book yet to be published.

00:00:50: Yes,

00:00:50: the book

00:00:51: is called Eclipse Horizon by Sotakaran Prabhakaran.

00:00:55: And I think it will be out in February.

00:00:58: And in this book, I think he really captures the essence of what our podcast is about pretty well.

00:01:04: which is this really revolutionary potential of biotechnologies that span from potential to really save our species from extinction towards really changing humanity in a very profound way and also leading to the understanding of what it actually means to be a human in a biological sense.

00:01:25: In this book we start in the year two thousand one hundred and eighty four.

00:01:33: Humanity stands at the precipice of an evolutionary divergence.

00:01:36: The old earthbound lineage has reached its limits, trapped by the constraints of biology, physics and an increasingly inhospitable planet.

00:01:46: What began as a threat, a combination of climate collapse, resource depletion and the looming specter of planetary extinction has birthed a radical solution.

00:01:56: the conscious compression of our genome, the shift from Homo sapiens to a newly optimized species.

00:02:03: Homo minimus is underway.

00:02:05: I think this is a great start to a book, makes me want to read more and I was lucky enough to actually do read more and it continues being a very very good read.

00:02:15: We will of course link this book in the show notes.

00:02:18: And now let me quickly introduce our guest before we dive right into this really, really interesting topic.

00:02:25: Dr.

00:02:25: Sudhakaran Prabhakaran is an associate teaching professor at Northeastern University, and he's also the CEO of a biotech non-exomics that works at the intersection of computational biology, AI-driven drug discovery, and the dark genome.

00:02:41: And there he really wants to explore the potential of groundbreaking genomics technologies and proteomics technologies for finding novel therapeutic hypothesis.

00:02:54: And as I introduced, he is also the author of the upcoming book, Eclipse Horizon, which takes this journey through the dark genome, but goes into the questions of evolution of the future of our species, of the future of our planet.

00:03:10: And might I say even the meaning of life a little bit.

00:03:15: Let's dive into this discussion and maybe start a little bit small because now I really laid it on pretty heavy in the beginning and maybe we can take smaller steps towards approaching all these exciting topics.

00:03:28: So we talked about it before, right?

00:03:30: There are a lot of parts of our genome that don't make sense to us as researchers and there has a term been coined for it in the nineteen seventies, which is junk.

00:03:41: which I think was debated already in the scientific community as maybe not being the most scientific term for parts of the genome that we don't understand.

00:03:50: And I think it was also meant by the guy who phrased it more in a controversial way just to spark interest, to spark debate probably.

00:03:59: But it was picked up, I think.

00:04:00: So for me, that was a common term when I studied biology that we have junk DNA.

00:04:06: this junk DNA, maybe we could classify as the parts of the genome we just couldn't make sense of, right?

00:04:12: Just the parts that do something, maybe they don't do anything, and maybe they don't have any meaning whatsoever.

00:04:19: And this really sparked the debate already of functionality.

00:04:23: So why do we have so much stuff in our genome that doesn't make sense?

00:04:28: And there are a number of cues of why maybe this large part consider it maybe ninety-eight percent of our genome that don't make sense.

00:04:40: The question of why would we actually carry along such a heavy load of junk?

00:04:47: That really doesn't seem to make too much sense, right?

00:04:50: So the question is, is it really all junk or is there some hidden meaning in it?

00:04:55: And I think this is something that we want to dive in today.

00:04:59: And maybe I can hand over Sudhakaran to you.

00:05:02: So where would you say are the most, I mean, in this trajectory of classifying, we have DNA, it does something, we have a hypothesis of how much of it is meaningful.

00:05:16: Where would you say are the maybe the most surprising events, the most surprising findings over the last decades where you felt like the field went through a shift?

00:05:26: First of all, thank you, Luis, for inviting me to your podcast.

00:05:29: And thank you, Andreas.

00:05:31: as well.

00:05:31: It's great and it's a honor to be here and I've been following your podcast covering biotech research findings and excitement in the field.

00:05:38: It's an honor to be here.

00:05:39: So to answer your question, a lot has to happen and you probably know as well as I do because you've been in the field working as a PhD student in postdoc as well, right?

00:05:48: So it started with a human genome project which culminated in two thousands.

00:05:53: When we finished the first draft of the human genome, as humanity, we realized that there are around twenty thousand to forty thousand genes.

00:06:01: But initially prior to the sequencing, it was estimated that we should have at least hundred to two hundred thousand genes.

00:06:08: A bit more complex than other organisms that we have sequenced like Drosophila and Ease, which are like smaller organisms.

00:06:14: But the puzzle was like, you know... Why do we still have the same number of genes as those organisms, right?

00:06:20: And then from two thousand to two thousand twenty up until now, a lot of people started looking at to, you know, what is this doing?

00:06:29: Are these other regions that were, as you rightly said, Stephen Ono, I believe, in nineteen seventies coined the term junk DNA.

00:06:35: Are these really junk?

00:06:37: Are they doing anything at all?

00:06:38: Then there was this big consortium of academic labs and Organizations that came together under an umbrella called the ENCODE project, and they systematically curated, like, what are these other regions doing?

00:06:51: And they noticed that, and they kind it with an interesting term, they called it biochemically active.

00:06:56: They said the rest of the genome is also biochemically active.

00:06:59: And it means a lot of things to a lot of people.

00:07:02: For example, there are mutations that affect other proteins, other genes, the known genes rather.

00:07:08: There could be regions that are opening up and closing.

00:07:11: enabling and disabling gene expressions and there could be regions which are basically repeat regions like multiple repeat regions and there could be.

00:07:20: there are regions which are actually making RNAs but not proteins so they were called as non-coding RNAs and non-coding RNAs blah blah blah.

00:07:29: and then there was this other fear which emerged from that effort where they noticed viruses being embedded in those regions, and that itself is a different, complete fail.

00:07:40: So around twenty fourteen, like when we started observing that these other regions, the rest of the ninety eight percent, can also make proteins.

00:07:48: That was the thing that we personally got excited in.

00:07:51: That was during the end of my post-rotler days, and we kind of, few other labs were also reporting that the human genome has this capability pervasively to make proteins, and that's kind of started off.

00:08:04: whatever I did in the last ten, fifteen years.

00:08:06: So proteins, RNAs, viruses, repetitive treatments, everything is biochemically active.

00:08:12: And I think one thing that also struck me as really exciting is that parts of the genome can just jump around.

00:08:18: That is interesting.

00:08:20: That we thought is something that only happens in bacteria and then maybe in plants and then it turned on.

00:08:25: happens in humans as well, and more than we thought.

00:08:28: And this, of course, also has some profound implications, right, that our genome is not as stable as we thought.

00:08:35: So it's not this, we think of the genome as this like, you know, like ancient text in a library that can only be touched by like the special librarian with the white gloves and no one can do anything about it.

00:08:47: And it seems more like a Word document where stuff can be like kind of shifted around and This is actually kind of interesting.

00:08:55: Very well said, yeah, exactly.

00:08:58: So from the seventies when we kind of addressed, when I say V, I'm kind of referring to humanity as such, kind of referred to these regions as junk, then there were these ideas, as you mentioned, some of these repetitive regions were actually moving around.

00:09:12: And that sparked a big debate.

00:09:14: There was somebody called Richard Dawkins who wrote this book called Selfish DNA, where he kind of put in this thought that they're probably existing just to copy themselves.

00:09:24: They're not doing any favor to host genome.

00:09:28: And that's primarily the primordial reasons why we are all here, because our need to copy, whether it's a small repetitive genome or a big genome, that's probably what happened after that.

00:09:50: May.

00:09:51: I quickly jump in first off must confess.

00:09:53: I'm really happy.

00:09:54: It's not two thousand one hundred and something yet because I have no interest whatsoever to become all more minimalist.

00:10:02: So

00:10:03: maybe you will be much optimized.

00:10:05: Probably.

00:10:06: Yes.

00:10:06: But I'm not so looking forward to that.

00:10:09: I rather stick to who I am.

00:10:10: I like it.

00:10:11: But if you want to go to Mars or beyond Mars,

00:10:14: I don't.

00:10:15: Yeah,

00:10:16: we will.

00:10:16: We will a little bit.

00:10:17: I think this is this is a really Really, really interesting thought experiment that I think we have to dive into a little bit at the end of this podcast.

00:10:27: Once we discussed like the hard science, I think we'll get there.

00:10:30: But as a non-scientist on the table, this was my initial point.

00:10:34: Listening to you, I must say that I would be very interested to know a little bit more about where do we stand right now.

00:10:42: So if I look at the timeline around the year two thousand, we thought we figured it out finally.

00:10:50: Then a little bit later, we find out junk is not that junkie after all.

00:10:56: So we're in a process.

00:10:58: where would you say with these mighty plus percent, where do we stand right now?

00:11:03: And what is your thinking?

00:11:05: What is your take about?

00:11:06: What is about to come?

00:11:07: That's a very interesting question, Andreas.

00:11:09: In fact, it's quite scary.

00:11:11: I'm living in the Boston area.

00:11:13: As you probably know, every week there's a new company with a new technology that's emerging.

00:11:18: A lot has happened since two thousand.

00:11:20: You know, it used to cost us around two billion dollars to sequence a single genome.

00:11:24: Now we are talking about a hundred, two hundred dollars, right?

00:11:27: Yes.

00:11:27: And then we are also talking about computational power.

00:11:30: We are no more doing it in our systems at our home.

00:11:33: We are doing it in cloud.

00:11:34: We are talking about we are in gendered AI space now.

00:11:37: A lot of genomics is now almost in gendered AI space.

00:11:41: We figured out the structures of proteins of almost all the proteins that is ever known using Alpha Fold and ESM Fold and a lot of other machine learning slash gendered AI tools.

00:11:52: It's quite scary to even predict what's going to happen in the next six months.

00:11:56: to be really honest and I don't know where it's going.

00:11:58: And a lot of my colleagues whom I kind of talk to I invite them to present in my even sessions.

00:12:05: They are also kind of in the same boat.

00:12:07: They don't know where it's going.

00:12:08: But I think the world will be a lot different from where we are right now.

00:12:11: I think that's for sure.

00:12:13: So what we're discovering are more and more regions that might have functional relevance also through just sequencing more of the genome and having more insights by AI.

00:12:27: which of these variants of the genome might be functionally relevant?

00:12:31: and I think one thing that I think we didn't discuss yet but that we'll get into is the question is when is it functional?

00:12:39: because not everything is functional in everything.

00:12:41: context, because it doesn't have to be.

00:12:43: But biology is like a super, super complex map of interactions of buffering of network of feedback loops or whatever.

00:12:53: So you will have functionality under a certain condition under a certain in the morning, but not in the evening.

00:13:00: If you're standing up, but not if you're sitting down, if you're a man, but not if you're a woman, if you are at but not if you're a twenty, if you had sugar, if you didn't have sugar.

00:13:09: And I mean, this is a very very, very simplified version of all the other things that can go on.

00:13:15: And this means functionality is not a yes-no status.

00:13:19: It's a status of under which conditions does it have a function.

00:13:23: And I think here this makes it so complicated to say something is functional or non-functional because there are conservative people who say like only eight percent are or ten percent of the genome are functional because there if you take them out, everything would collapse kind of.

00:13:42: But again, then the question is under which conditions would it collapse, which brings us to our thought experiment in a second.

00:13:48: But what I think we are discovering is that we have a number of different regulatory elements.

00:13:53: We have these viral remnants, these herbs and herbs, these elements that used to be viruses somehow integrated into.

00:14:02: our DNA and with our, I mean, also the evolution that came before humans.

00:14:07: So it's not human specific.

00:14:08: There are a lot of remnants of viruses such as retroviruses that actually infected our ancestors at some point and then just became part of our genome.

00:14:18: For example, the placental gene for the mammal origin actually seems to be at least influenced or part of a retrovirus.

00:14:27: So here, I mean, there is a really these viruses interact with us, not all of them, but some of them can actually also be reactivated.

00:14:35: So we have these guys, we have a lot of RNAs that have many different functions.

00:14:40: And when I studied in two thousand two, I think the micro RNAs and as RNAs were pretty freshly discovered in the late nineties.

00:14:51: And I was so fascinated that there was a completely new molecule being discovered that no one had ever heard of.

00:14:58: So I'm wondering, Suthakaran, do you think there will actually be new molecules that we don't know about, that we are seeing emerging from, I mean, maybe not completely new species, but new species of RNAs or peptides or something that we haven't heard about or something completely different that will come out of the dark genome, maybe?

00:15:17: Thank you, Luis.

00:15:17: You're actually a good segue into what we've been working on all these years.

00:15:21: I can explain that in that context.

00:15:23: So as Luis mentioned, function means a lot of things to a lot of people.

00:15:27: For us, function means the ability to make proteins, right?

00:15:30: So what we have seen and a few other groups as well have noticed is that we have personally systematically curated and cataloged two hundred fifty thousand additional set of proteins on top of the twenty thousand that is known and seen and investigated by every other group.

00:15:46: This additional two hundred fifty thousand proteins we have in the last couple of years mapped their structure and inferred their function and inferred their dysfunction and different diseases and also investigated whether they can be modulated.

00:16:00: by different mechanisms and cure those diseases.

00:16:03: So that is all known and true information that I'm just giving you.

00:16:08: But in addition to that, computationally, we have predicted there are at least two million such proteins in the human genome.

00:16:15: And as Louise correctly mentioned, they're not all the time being made.

00:16:19: And they're being made under specific conditions or specific physiological state.

00:16:26: For example, different periods of growth, different disease conditions, different tissues, different statuses and situations.

00:16:33: The way I look at it is that the twenty thousand or so known proteins is something like housekeeping genes, like something that is necessary to make us all.

00:16:42: But then there's a difference between different individuals and different organs.

00:16:47: respond to different kinds of stress and for that they can't just rely on those twenty thousand genes.

00:16:51: so they have to make new things like think of genome as a as a Lego.

00:16:55: you just take bits and pieces of Lego and make a new protein and then the new protein does a new function.

00:17:02: now you can ask why are we not finding them all the time?

00:17:05: probably after the stress is gone this protein disappears.

00:17:09: you know doesn't need it.

00:17:10: for example What we have shown in the evolutionary analysis is that newly evolving regions of the human genome that are called accelerated regions that are not even present in primates, they make proteins and those proteins are very strongly associated with schizophrenia, bipolar, those kind of cognitive dysfunctions, which indicates that there are some new proteins that are being made and specific for human cognition and probably they are disrupted in cognitive diseases like... Schizophrenia bipolar, psychotic diseases.

00:17:42: So it kind of puts in all these things together and we have seen this in other organisms too, even fishes.

00:17:48: We have shown that speciation occurs not by changes to the existing known set of proteins but by making new proteins that is needed for that specific environment.

00:17:58: And I'll come into that example in a bit more specifically giving you what that example is.

00:18:03: So that's pretty much what I want to explain to you that functionally we are seeing proteins being made throughout the human genome.

00:18:10: In fact, a lot of these long non-coding RNAs that we have claimed that they are not making any proteins, we have shown that and other people have also shown that they're making proteins.

00:18:20: Yeah.

00:18:20: So some of them are not the full-length proteins that three hundred amino acid proteins that we are usually used to seeing.

00:18:26: There could be thirty amino acids, four hundred amino acids, you know.

00:18:30: Why is it that we're only now seeing these proteins?

00:18:34: Why didn't we capture them in the human genome project back then?

00:18:38: Thank you for asking.

00:18:39: I get this asked all the time.

00:18:40: It's not that we are just making it up.

00:18:42: It's just that.

00:18:43: let me let me give you another metaphor.

00:18:45: Let's say we are flying at forty thousand feet in a Boeing three seven three seven right over the Himalayas and we are able to see only the top eight peaks right.

00:18:55: and then as we come down to twenty thousand feet we are seeing other peaks which are less than eight thousand meters.

00:19:01: and then as you keep landing we are seeing smaller valleys and then small houses and small trees and small stones.

00:19:07: So the reason why we didn't see is primarily because we are not looking for it.

00:19:11: Our technology didn't allow us to look for what the entire genome is capable of doing.

00:19:15: What is it doing?

00:19:16: We were just seeing what we wanted to see.

00:19:18: For example, currently if you ask how proteomics experiment is done, people look at a set of whatever they're collecting mass spec data or ribosome data.

00:19:28: they're only looking at a known set of proteins that they have already identified and then seeing the signatures.

00:19:34: okay is this protein being expressed in the cell line?

00:19:36: is this protein being expressed and they take off those boxes but they discard around sixty seventy percent of information that they don't know what to do with it.

00:19:44: either the too noisy because the technology was not that sensitive or they don't know what to do with it.

00:19:49: it's it's been always there in fact a lot of information that we collect.

00:19:53: It's been already collected by other groups.

00:19:55: We just take that information and re-analyze it.

00:19:58: Ten years ago, we couldn't do those algorithmic crunching.

00:20:01: that we are able to do right now.

00:20:04: And more data is becoming available because the sequence it costs is, as I said, come down from two billion dollars to a few hundred dollars.

00:20:11: So all of this has enabled us to dig deeper.

00:20:20: Could you explain briefly what the role of ribosome sequencing is and how that works like in a way that also maybe our non-scientist listeners might glimpse why this is actually important?

00:20:32: So thank you.

00:20:33: So the ribosome is a machinery where the RNAs bind to the ribosomes and they make proteins, right?

00:20:40: So ribosome sequencing is nothing but a methodology where as the RNA is binding to the ribosome, it's fixed.

00:20:48: in some using some chemical induction and then we pull out those ribosomes and sequence whatever that is bound to the ribosome the RNA that is bound to the ribosome.

00:20:57: Here we are kind of taking a big leap faith assuming that whatever that is bound.

00:21:01: RNA that is bound is getting translated into proteins.

00:21:04: So the actual term for that is translate home not proteome because we're just inferring whatever is bound to RNA.

00:21:10: ribosome is getting converted into proteins.

00:21:13: But then there's another technology which I'll come to, it's called mass spectrometry, where we are actually directly sequencing the proteins and not relying on whether they are binding to ribosome or not.

00:21:23: So there's a kind of two complementary methodology that we always use to identify these so-called dark proteins.

00:21:29: What does a dark protein mean to you?

00:21:32: How would you classify what a dark protein is?

00:21:36: It's a very interesting question.

00:21:37: So that protein for us means simply proteins that have not currently annotated.

00:21:42: So what I mean by annotation is at the moment there are around twenty thousand or proteins that are coming from what the so-called known genes.

00:21:49: So the problem with this known gene concept is that it has been set in stone, meaning thirty to forty years ago.

00:21:57: they said a gene should look like this.

00:21:59: A gene should have a certain length and it should have a certain structure like a start site, stop site, multiple exons, exon boundaries, intron boundaries, stop site, poly A tail, and all of this has made us to look for these genes and proteins that are coming from these genes.

00:22:16: It's like saying, okay, a car, when I say a car, we think of like a sedan car, right?

00:22:21: Like with four doors, four wheels and a steering wheel.

00:22:25: But if I say a car can be like a Cybertruck, like Tesla's Cybertruck or...

00:22:29: This is not a car.

00:22:30: Yeah,

00:22:30: it's not a car.

00:22:32: Not a truck, right?

00:22:33: So we're kind of forced to rethink what it is.

00:22:37: So for us, these proteins, proteins that are coming from these unconventional regions is what we call as dark protein.

00:22:43: It could be shot in land.

00:22:44: It could be coming from... alternate regions.

00:22:47: so just to get in depth of this protein is actually encoded in what is called as one frame of the genome meaning every three nucleotide quotes for a codon and twenty codons make a protein right.

00:22:58: but then if you shift one codon you get a completely new sequence what we call as alternate frames or frame two and likewise if you move one more codon you get another sequence of proteins.

00:23:10: so it turns out that you can actually encode three different kinds of proteins in the same genome sequence.

00:23:16: The interesting part is viruses and bacteria have been clever and they've been using this forever to make, you know, encode as much pack information as much as they can.

00:23:25: But in humans, we have known that this happens but we even investigated.

00:23:30: So we did that and few other, you know, recently people have also done that.

00:23:33: It turns out that almost every gene that we have encodes proteins in all frames and that expands the protein numbers that we know of.

00:23:41: and that is actually the premise of this book.

00:23:43: like coming back to this concept of how do we encode that information.

00:23:48: Would you say we need to update our based on what you find the idea of what a gene is?

00:23:54: Yes, we have been calling for a redefinition of a gene since twenty sixteen time.

00:24:00: There are academic concessions.

00:24:02: Also, thinking on those lines, standardization of what a gene should be.

00:24:05: And a lot of people are now aware of that, and we are continuously, you know, big organizations like Sanger and other organizations in the US, Brood Institute, are all involved in such efforts right now.

00:24:17: It took us a long time to show that these so-called dark proteins are true, and they are stable, and they are functional, they are involved in diseases.

00:24:25: They're not just biological

00:24:26: noise.

00:24:26: They're not an artifact, or noise.

00:24:28: Yeah.

00:24:37: But then again, if you have a new definition, then I look at computational power that is doubled in maybe one year's time or so, you have new findings.

00:24:49: have to

00:24:50: redo, redo

00:24:51: again probably.

00:24:52: But there is a lot of unknown stuff.

00:24:55: So I mean, it took us ages to get to the two percent.

00:24:59: And now we have ninety eight percent left that still need to be deciphered.

00:25:03: So I think the computational power can increase a little bit more to make sense out of the genome.

00:25:08: I mean, the thing is that the genome is not as we thought like something that only as Sudhakaran just introduced.

00:25:16: I mean, it doesn't only read one way, but we have I mean, we have multiple frames to read it.

00:25:21: So basically, it's like a book that's written in, it's not written in multiple languages, but it has hidden messages within the sentences that it has.

00:25:31: And then there are regulatory elements that have influences over long stretches of DNA that do something because, I mean, DNA is coiled around and very nonlinear in a sense, despite the fact that we have been made to believe that it's a very linear flow of information because we have this strong double helix image and this, okay, now we have the meaning.

00:25:56: And then it became very messy.

00:25:58: Which actually brings me to a question, Sudhakar, and if I may, do you think there is the way that we think about genes, that we think about the genome, the dark genome, is influenced by the belief system of researchers at the time to an extent?

00:26:12: It's an interesting question.

00:26:13: There are evidences, right?

00:26:14: People are going by what they're observing and experimenting.

00:26:18: The thing that you mentioned, this whole field of three-dimensional architecture of the genome itself, how it is regulating the expression of genes and giving rise to the complexity that we know of, right?

00:26:29: So I didn't even touch upon that.

00:26:31: That is another huge goldmine.

00:26:33: So for different people, it's a different thing that they're passionate about.

00:26:36: And I think for us, for us investigating these proteins that we are definitely seeing because we have tagged them, we have knocked them out, we are seeing a phenotypic observation if we knock them out.

00:26:46: So we are excited to know what are these twins doing.

00:26:49: So that is kind of the world that we live in.

00:26:51: Yes, we are aware that there are other developments and exciting areas, but for us, this is the part that...

00:26:58: Probably, I mean, the thing is there is so much going on, you cannot focus on all of it.

00:27:11: Maybe we can dive a little bit into the more, how you say, I mean, not philosophical, but into these parts of what does this all actually mean?

00:27:20: Because I think this is what I find really fascinating about your book.

00:27:24: I have to say sometimes when there are multiple pages with the calculations, I'm not following one hundred percent, but I know what the outcome is.

00:27:35: But I think this idea of... evolution as information theory that you introduce, which I find really fascinating.

00:27:45: Maybe you could introduce a little bit what your view is and also maybe what other researchers are exploring in terms of, we started right with this question, why are we carrying all this evolutionary baggage with us?

00:28:00: Because other organisms that are less complex arguably than humans have partly more condensed genomes maybe don't have as much repetitive messy things going on and I mean it takes a lot of energy right to replicate that in each of our many many cells every time that a cell divides and during the formation of the embryo.

00:28:25: And what one also has to take into account, we have very elaborate systems to constantly scan and repair our DNA in order not to cause mutations that we don't want to avoid cancer or other issues.

00:28:38: So we're really investing a lot in keeping a lot of DNA going that we might not really need.

00:28:45: So what's going on there?

00:28:47: Thank you.

00:28:48: You're asking me to give away my secret of the book, but I'm happy to do it.

00:28:52: You can

00:28:52: hint towards it.

00:28:53: Maybe you can do the calculations for us.

00:28:56: There's no hidden message or it's not a mystery book.

00:28:58: The premise that is set in the book is that the genome has this adaptive potential.

00:29:03: What is adaptive potential?

00:29:05: The ability for it to make a new gene and new protein whenever and as and whenever it needs.

00:29:11: So this is not in a theological sense, but it's in a systems biological sense.

00:29:15: It's better to be adaptive.

00:29:17: When I used the word purpose, not in a theological sense, but systems biological.

00:29:21: how much information that you can actually put in such a way that if I transmit a piece of DNA, I've explained why DNA is used as a method to transmit information, but then if we go beyond that, if you accept that, if you go to Mars, if you go to Neptune, if you go to Andromeda, if you go to the center of Milky Way Galaxy, it's all going to be different environment, right?

00:29:40: But you have only one stretch of DNA, and how do you make that such a way that this particular set of DNA is going to thrive in every place?

00:29:48: It's going to make a set of proteins that is going to make an organism in that place.

00:29:52: It's going to make an organism in Andromeda.

00:29:55: All of these organisms should flourish and multiply, create diversity, and all of that.

00:29:59: So that's where this book starts with.

00:30:01: The ability to, why do we have, you know, if you really look at it, there are twenty thousand genes that we know of, comes from forty to sixty million base pairs.

00:30:10: But we have three point two billion base pairs.

00:30:13: So what are the rest of the things doing?

00:30:15: not only for us, but for all other organisms, it's just that it has this adaptive capacity to encode environmental information such that when it encounters new environment, it makes new proteins that enable it to survive.

00:30:29: The simplest information, that's about it.

00:30:32: And how minimum that should be is what we have, you know, I wouldn't say the equations and calculations in the book are perfect and final, but it's a sitting process link, you know.

00:30:42: Okay, given these constraints, this is what with that equations, what I calculated was we can reduce three point two billion base pairs to one point two billion base pairs.

00:30:50: So it's like thirty seven percent reduction.

00:30:52: And that is just to escape solar system, just our solar system.

00:30:56: Now, if you want to go beyond that, there are some equations.

00:30:59: that tells you what needs to be changed.

00:31:01: So that's the kind of thing.

00:31:02: How much you can reduce and how much you can use, you need to have the adaptive potential still retained.

00:31:08: So that is the whole point.

00:31:10: I find this really fascinating because this sounds very science fiction, but in a simpler system, like in bacteria, microplasmum, genitalium, I think it was, and also then later in E. coli, researchers at the Craig Vanter Institute.

00:31:25: Vanter is one of the genome project guys who was in the race to sequence the first genome, and I think was also the first, but I mean, it didn't matter in the end.

00:31:33: I

00:31:34: mean, it does matter because they have contributed to the sequencing.

00:31:38: Yeah, they did.

00:31:38: They did a lot.

00:31:39: And they actually went into this idea of compressing the genomes of microbes.

00:31:45: And these are the first synthetic microbes that have this very limited, this minimum set of genes that they need to survive.

00:31:53: So this idea of moving that from the smallest possible organism to us as humans is, of course, from a thought experiment really, really, really fascinating.

00:32:05: How would you say along?

00:32:07: I mean, you mentioned also speciation and the role in other species.

00:32:11: Would you say the dark genome is what evolves more than the light genome?

00:32:18: Let's put it like that.

00:32:19: Would that be a statement to make or is that pure speculation that I'm throwing out here?

00:32:25: You're spot on.

00:32:26: I need three or four more podcasts.

00:32:28: Podcast series to talk about this because this is an exciting area of research and I'm passionate about it.

00:32:33: That's what we say and that's what.

00:32:34: because the problem is, let's say, you know, if you've been noticing the recent press releases by NASA and other organizations, which brought back samples from Benoy and few other asteroids, they have been seeing tryptophan and few other amino acids, right?

00:32:49: Like pretty much fifteen out of twenty, I believe.

00:32:51: Amino acids have been already found.

00:32:53: in asteroids which are more than four billion years old, like beyond Earth's time, right?

00:32:58: So all this is just showing that, you know, the primary constructs have been there.

00:33:02: So what we are saying is that the random set of DNA, we don't call it genome, the random set of DNA is constantly exploring the environment and making proteins as they need.

00:33:13: And that's where when they see a new protein is very important for its function.

00:33:18: And it's fixed, meaning across multiple eras of evolution, it keeps those proteins.

00:33:24: Whereas those that are not needed as much, they're just kind of throwing it out or making a new product out of it.

00:33:30: So this constant process is going on like testing what is needed, testing what's not needed.

00:33:35: And that's why we have come to this twenty thousand.

00:33:37: And those twenty thousand are fixed and you know, we kind of are essential for us to function.

00:33:41: And then probably we can't tinker with them too much, right?

00:33:45: evolutionally they are fixed but whereas the other regions which are the most important ones because they're still exploring, they're still figuring out what is needed to take these guys to the next stage.

00:33:55: We need new.

00:33:58: oxygen levels are different, carbon dioxide levels are different, can we change something to make these guys more adaptable?

00:34:03: So there's this process going on but they can't touch the known.

00:34:07: So less evolution is going on there.

00:34:09: Although I wouldn't say there's no evolution going on there, but less evolution than in the so-called dark genome.

00:34:15: Because there's more flexibility to do things there.

00:34:18: Absolutely amazing.

00:34:20: And my mind is crossing right now.

00:34:22: First off, of course, we can continue this conversation in the second or third podcast because...

00:34:29: Yeah, I think there's more to explore.

00:34:31: Absolutely.

00:34:32: And it's so fascinating.

00:34:34: You know which image is crossing my mind right now?

00:34:37: A scientist two and a half, three thousand years ago, they looked at the stars and stuff and tried to figure out.

00:34:45: Yeah, and looked at the nature around and tried to figure out how the world works.

00:34:49: Exactly.

00:34:50: And they did.

00:34:51: science and theology and philosophy all at once.

00:34:55: Leaving me to the question, your findings and your... I have a sense of what it means to put together a book like this and how much stress on the one hand side, fulfillment on the other hand side, that means of course, but does this make you, in a sense, I might say humble?

00:35:15: I mentioned this to Louise last time when I met her.

00:35:17: When I started my PhD, microarrays and high throughput studies were becoming mainstream.

00:35:23: We did lots of genomics, proteomics, lipidomics, metallomics.

00:35:26: We got tons of data.

00:35:28: And then at that point, it was so exciting.

00:35:30: You know, we're going to figure out, fix schizophrenia, bipolar, we can cure diseases, right?

00:35:34: And that was twenty-two years ago.

00:35:37: And we're still here.

00:35:38: We still haven't understood anything.

00:35:40: And the more we scratch, the more we see.

00:35:43: And that's where we are.

00:35:44: Yeah, I couldn't agree more.

00:35:46: I think this is so interesting that we thought like, okay, now we read that like after the publication of the Human Genome Project was really like this belief.

00:35:55: Now we know it all.

00:35:56: And as you say, I mean, there is so much we still don't know and that we have to figure out.

00:36:02: And but this is also very cool.

00:36:04: And I mean, I also think it's humbling to know how much there is still to discover and how much excitement lies in there and how much understanding of what it means to be human and how our planet works and maybe how life outside our planet could work.

00:36:19: so I think.

00:36:20: You come too close for today but with understanding that there will be a follow-up I guess.

00:36:27: We'll make sure that we'll put a link to that book, of course, into the show notes, as well as some central findings, as always.

00:36:36: And thank you, thank you very, very much.

00:36:39: It was an amazing journey so far with you.

00:36:42: Thank you so much for inviting me here.

00:36:44: It's been fun.

00:36:45: And thanks for the thoughtful questions.

00:36:46: And thanks, Louise, for reading the book.

00:36:48: I don't know whether you got to finish it, but it's a lot of dense information in the middle.

00:36:51: And I heard that feedback from a few people who read that and gave me that feedback.

00:36:55: But thank you for doing that.

00:36:57: It's really a fun read.

00:36:58: I have to say, for me, it really hits this sweet spot between science philosophy and some like science fiction thought experiments.

00:37:05: So I think this for anyone who likes to think a little bit.

00:37:09: outside the box and about the bigger picture of science.

00:37:12: This is really a good book to read.

00:37:18: This was the Biorevolution podcast.

00:37:21: Many thanks for listening or following us on YouTube.

00:37:25: As always, feel free to comment to say something on Spotify, for instance, or YouTube.

00:37:33: We really appreciate that.

00:37:34: And further information, as always, you'll find on sciencetales.com.

00:37:39: Stay tuned.

00:37:40: We'll be up for... Many, many more episodes.

00:37:44: Thank you.