My previous lab and I recently published a review on the techniques and possibilities of analysing TCR repertoire data produced from high-throughput sequencing.
By and large I'm exceedingly happy with it - apart from a couple of missed references and one very unfortunate mix up regarding the accessibility, I'm very pleased with how it came out (and hope it will prove useful!).
One thing that I'm particularly pleased that we included (in spite of the lack of published descriptions yet) is the pair of manually curated TCR databases that have recently emerged: VDJdb and McPAS-TCR, in which you can find a small (but growing) host of TCRs of known specificity and/or disease association. We thought it was important to get these out there as soon as possible, as this is a rapidly changing field which is currently sorely needing for such efforts at standardisation and resource development.
With that in mind, I've been playing around with both of these, and thought I'd share some of the bare bones of the bash code I've been using to pull out sequences related to epitopes I'm interested in. Here is my quick vignette using VDJdb to pull out HIV-reactive TCR sequences - and even then just the fields of the database I'm interested in - using basically just default terminal commands.
My thoughts on immunology, T-cell receptors, next-generation sequencing, molecular biology, and anything else that takes my fancy.
Showing posts with label VDJ. Show all posts
Showing posts with label VDJ. Show all posts
Wednesday, 1 March 2017
Saturday, 28 January 2017
DNA in different cells, preventing autolysis and foetal cells in vaccine production
Continuing on the
theme from my
last post, here's a selection of my answers to recent questions
that I saw on Reddit which I thought were interesting.
Is every cell [in our body] carrying the same DNA?
"The standard answer is usually yes, apart from...
Is every cell [in our body] carrying the same DNA?
"The standard answer is usually yes, apart from...
• Mutations
are the most obvious differences between cells, which usually
comes up when this question gets asked. This happens in non- and
pre-cancerous cells, but as genetic instability is a common property
of cancer it tends to be much worse in cancer cells. It's also worth
remembering that doesn't just mean the wrong base of DNA at a
position, but can include insertions, deletions and duplications, not
just of a base or two but potentially up to whole or huge chunks of
chromosomes, even fusion between different chromosomes (which can
make fusion proteins with novel functions). This is why you sometimes
see aneuploidy (an atypical number of chromosomes) in some cancer
cells.
• Gametes
(i.e. sperm and egg cells, and their precursors). These germline
cells aren't always grouped in when people ask this question, but
they are definitely 'in our body' so I am! Not only are these cells
haploid (only containing one copy of each chromosome) but during
meiosis (the kind of cell division that produces them) the
chromosomes undergo recombination,
so the pairs of each chromosome will swap bits; this means that the
gametes you produce won't have the same versions of the chromosomes
that you inherited from your parents, but something in between. This
helps keep our gene pool diversified.
• My
particular favourite, as it's what I work on - adaptive immune
cells. There are potentially infinite different kinds of viruses,
bacteria and fungi etc which could infect us and do us harm, which we
need to protect ourselves against. This is pretty hard to do with a
finite, static genome, as the pathogens could quickly evolve around
it. What we evolved is a branch of immunity - our adaptive immunity -
which anticipates this huge diversity of infectious agents and
responds in kind, by pre-emptively shuffling bits of DNA around to
make millions of different receptors, to try to recognise as many
different (non-self) things as possible. This happens in developing
B-cells and T-cells, which is used to make B-cell receptors (which
when released in a soluble form become antibodies) and T-cell
receptors (BCRs and TCRs). This is acheived through a process called
VDJ
recombination, named after the segments of DNA
which get recombined together to form a new gene. This provides the
basis for how our immune systems learn - if you get infected with
something that a particular TCR can bind say, that T-cell will divide
and differentiate, which means that the next time you get infected
with it those T-cells are already in place, waiting to go and fight
it off.
• Microchimerism.
In biology a chimera is an organism that contains cells from more
than one zygote (fertilised egg). This happens in labs lot for
various reasons (which is why you get mice
like the one on the right, made up of cells from black-furred and
white-furred mice zygotes), but it also happens
naturally at some rate (with only a few cells making it
microchimerism). The most common example we know of (at least
for us placental mammals) is foetal
chimerism, where cells from a developing foetus
pass through the placenta and establish themselves - sometimes
permanently - in the mother (which may help prevent her immune system
rejecting the foetus). There are case reports where cells can go the
other way (so say cells
from the mother can be detected in the circulation of her child),
although I think this mostly occurs when one or other seems to have
some genetic condition. There also of course exist actual human
chimeras - anyone who's ever had an organ or bone marrow transplant
will have a large number of cells in which the DNA will be very
different (although hopefully not at the MHC alleles, which mediate
rejection), as it all came from the donor."
Why aren't cytoplasmic granules of natural killer cells degraded by the potent enzymes that they contain?
"There's a number of different
mechanisms by which cytotoxicity is controlled. There's also some
disagreement, and a lot we still don't know (which is always a good
sign that someone's asking a good question!).
In terms of perforin specifically, there's two factors that come
to mind that are probably the best answers to question. Note that
they also all largely apply to cytotoxic T-cells as well as NK cells,
as they use the same basic cytolytic machinery:
•
Perforin requires calcium to form pores and insert to membranes, at
concentrations that are typically only found outside (and not inside)
cells. This means that the perforin should only work once its been
secreted across an immune synapse.
•
It also requires a pH above ~6, in order to adopt the correct
conformation. Stored lytic granules are pretty acidic, which helps
maintain the contents inactive (unless they are release and the acid
is diluted out)
These points are covered pretty well in
this
detailed review, if you're interested. There are a
number of other possibilities - like there may exist certain
chaperones or regulatory
proteins which help keep the perforin inactive, or
that cleavable post-translational modifications may help keep in an
inactive form. That latter one was quite notable, although it seems
opinion has moved towards the glycosylation actually helping guide
perforin along through the ER quickly during synthesis,
to stop it lingering in calcium-rich/pH neutral compartments where it
might do some damage."
[How is the HepA vaccine ethical when it uses MRC-5 cells?]
(NB: I think that this was probably just an anti-vaxxer account trying to colour people's views against vaccines given that it's the sole post from this user in a sub that gets high traffic from anti-vaccine proponents, but it is a valid question that a quick google doesn't produce many good answers for, so I thought it was worth addressing.)
"I'm presuming the ethical problem
you're having is that some people have here is that the vaccine
production involves MRC-5 cells, which are derived from an abortus
foetus?
First off it's worth correcting one
thing - the vaccine won't actually contain MRC-5 cells - it
just uses the cells to grow the virus, which will then be inactivated
to make the vaccine. Remember that viruses cannot grow on their own,
they need to use cells to do so, so it's impossible to make an
inactivated viral vaccine without cells. (It's also mostly impossible
to make protein-subunit vaccines without cells, although you can use
non-mammalian cells like bacteria or yeast in that case.)
However if your issue is with the fact
that foetal cells were used at all, that's slightly trickier, and
your interpretation of the facts may change depending on your
viewpoint.
My view point is that early stage
embryos are not sentient, and certainly not sapient, and aren't
really 'people' as such (there's actually some great discussion on
this in a
thread that came up earlier in /r/biology today, which deals with
this topic very well). MRC-5 cells came from a
14-week old foetus that was aborted for psychiatric reasons,
well before the demonstrably concious stage of development.
Another way to look at it is like organ
donation. If a baby died (for whatever reason), would you think it
was unethical to transplant any organs from that child to others to
save their lives? Despite a tragic thing happening, one, two, maybe
even three other lives might have been saved or prolonged. If that's
acceptable to you, consider that the foetus cells have basically been
donated, more than forty years now, to an effort to protect millions
of people from a horrible disease. Over 188
million doses of Hep A vaccines have been given; as
just under 1%
of infected people would be expected to die, a
rough estimate would be that Hep A vaccination has probably saved at
least 17 million lives (and prevented a great deal of non-fatal yet
horrible disease).
(In fairness I'm not sure how many of
those doses used MRC-5 derived vaccines, but then these cells are
also used in the production of vaccines for other diseases too.)
From a different perspective, even if
you don't accept the evidence that foetuses aren't sentient, even if
you don't care and think that humanity begins at conception, even if
you don't buy the organ/tissue donation analogy, there's a final
pragmatic argument: we have the cells, and they work. If we want to
stop people contracting, suffering or dying from preventable
diseases, we need to use the tools that we have available. Hep A is a
nasty disease, and we have a highly safe and effective vaccine - to
my mind, advising people to not get the vaccine (in the absence of an
equally good alternative) would be a much more unethical
alternative."
Sunday, 4 May 2014
Translating TCR sequences addendum: not as easy as FGXG
I recently wrote a blog post about the strategies used to translate T-cell receptor nucleotides en masse and extract (what can arguable be considered) the useful bit: the CDR3.
In that talk I touched on the IMGT-definition of the CDR3: it runs from the second conserved cysteine in the V region to the conserved FGXG motif in the J. Nice and easy, but we have to remember that it's the conserved bit that's key here: there are other cysteines to factor in, and there are a few germline J genes that don't use the typical FGXG motif.
However even that paints too simple a picture, so here's a quick follow up point:
These are human-imposed definitions, based more on convenience for human-understanding than biological necessity. The fact is that we might well produce a number of TCRs that don't make use of these motifs at all, but that are still able to function perfectly well; assuming the C/FGXG motifs have function, it's possible alternative motifs might compensate for these.
I have examples in my own sequence data that appear to clearly show these motifs having been deleted into, and then replaced with different nucleotides encoding the same. Alternative residues must certainly be introduced on occasion, and I'd be surprised if none of these make it through selection; we just don't see these because we aren't able to generate rules to computationally look for these.
I actually even recently found such an example with verified biological activity: this paper sequenced tetramer-sorted HIV-reactive T-cells, revealing one that contained an alpha chain using the CDR3 'CAVNIGFGNVLHCGSG'.*
For the majority of analyses, looking for rare exceptions to rules probably won't make much difference. However as we increase the resolution and throughput of our experiments, we're going to find more and more examples of things which don't fit the tidy rules we made up when we weren't looking so deeply. If we're going to get the most out of our 'big data', we need to be ready for them
* I was looking through the literature harvesting CDR3s, which reminds me of another point I want to make. Can I just ask, from the bottom of my heart, for people to put their CDR3s in sensible formats so that others can make use of them? Ideally, give me the nucleotide sequence. Bare minimum, give me the CDR3 sequence as well as which V and J were used (and while I stick to IMGT standards, I won't judge you if you don't - but do say which standards you are using!). Most of all, and I can't stress this enough, please please PLEASE make all DNA/amino acid sequences copyable.**
** Although spending valuable time copying out or removing PDF-copying errors from hundreds of sequences drives me ever so slightly nearer to a breakdown, it does allow me to play that excellent game of "what's the longest actual word I can find in biological sequences". For CDR3s, I'm up to a sixer with 'CASSIS'.
In that talk I touched on the IMGT-definition of the CDR3: it runs from the second conserved cysteine in the V region to the conserved FGXG motif in the J. Nice and easy, but we have to remember that it's the conserved bit that's key here: there are other cysteines to factor in, and there are a few germline J genes that don't use the typical FGXG motif.
However even that paints too simple a picture, so here's a quick follow up point:
These are human-imposed definitions, based more on convenience for human-understanding than biological necessity. The fact is that we might well produce a number of TCRs that don't make use of these motifs at all, but that are still able to function perfectly well; assuming the C/FGXG motifs have function, it's possible alternative motifs might compensate for these.
I have examples in my own sequence data that appear to clearly show these motifs having been deleted into, and then replaced with different nucleotides encoding the same. Alternative residues must certainly be introduced on occasion, and I'd be surprised if none of these make it through selection; we just don't see these because we aren't able to generate rules to computationally look for these.
I actually even recently found such an example with verified biological activity: this paper sequenced tetramer-sorted HIV-reactive T-cells, revealing one that contained an alpha chain using the CDR3 'CAVNIGFGNVLHCGSG'.*
For the majority of analyses, looking for rare exceptions to rules probably won't make much difference. However as we increase the resolution and throughput of our experiments, we're going to find more and more examples of things which don't fit the tidy rules we made up when we weren't looking so deeply. If we're going to get the most out of our 'big data', we need to be ready for them
* I was looking through the literature harvesting CDR3s, which reminds me of another point I want to make. Can I just ask, from the bottom of my heart, for people to put their CDR3s in sensible formats so that others can make use of them? Ideally, give me the nucleotide sequence. Bare minimum, give me the CDR3 sequence as well as which V and J were used (and while I stick to IMGT standards, I won't judge you if you don't - but do say which standards you are using!). Most of all, and I can't stress this enough, please please PLEASE make all DNA/amino acid sequences copyable.**
** Although spending valuable time copying out or removing PDF-copying errors from hundreds of sequences drives me ever so slightly nearer to a breakdown, it does allow me to play that excellent game of "what's the longest actual word I can find in biological sequences". For CDR3s, I'm up to a sixer with 'CASSIS'.
Labels:
CDR3,
cysteine,
deletion,
FGXG,
IMGT,
NGS,
recombination,
repertoire,
T-cell receptor,
T-cells,
TCR,
VDJ
Subscribe to:
Posts (Atom)