Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Depends on your definition of small - but I accept the point that if the peptides are small enough it can be useful and I went too far there.

Remember the problem posed was to calculate the mass of proteins upto 1000aa where the difference between mono-isotopic mass and real average mass would be many 10's of daltons - much more than the missing water!



I think the problem is poorly worded, which lends itself to the confusion we experience, unless we know (presumably from the context of the goals of this project) that these physical details are beyond the scope of the project.

That is, I think "useful" here is meant as "useful in learning to program", not "useful in actual mass spectra analysis."

Eg, you write "calculate the mass of proteins up to 1000aa".

There's a couple of picky details.

1) The text says "total weight of [A protein string] P", where "The standard weight assigned to each member of the 20-symbol amino acid alphabet is the monoisotopic mass of the corresponding amino acid".

(I'm ignoring that "weight" != "mass" because in this context those are synonyms.)

It reads like the text defines "standard weight" in terms of the monoisotopic mass, and asks to compute that weight. So the problem is not asking to "calculate the mass of proteins up to 1000aa" but "calculate the monoisiotopic mass of proteins up to 1000aa." It further says "all amino acid masses are assumed to be monoisotopic unless otherwise stated".

(Alas, the explanatory text goes on to say "There are two standard ways of computing the mass", which contradicts the assertion that there's a "the standard weight.")

2) It says "protein string" in the problem, not protein, and clarifies that "In the following several problems on applications of mass spectrometry, we avoid the complication of having to distinguish between residues and non-residues by only considering peptides excised from the middle of the protein."

That is, the problem posed was not to calculate the "[monoisotopic] mass of proteins" but something more like the "[monoisotopic] mass of peptides excised from the middle of the protein, represented as a protein string."

3) In trying to understand this @$%@#$%&%$ topic more, I found https://patentimages.storage.googleapis.com/42/6b/2b/ec3a694... which I believe says that for very accurate mass spectrometers, for large proteins, the most abundant mass may be more useful than the average mass.


It's suppose to be an educational tool - bioinformatics is more than writing programs to add up numbers it's about understanding the science behind it.

So I didn't like the question because it treated the problem as a simple 'write a program to add up a list of numbers, based on a lookup table',rather than address the real science issues around protein mass. ( Here the real challenges actually come from post-translational modifications - which makes mass matching a very hard problem indeed - with lots of challenges for anyone in computer science )


In my top-level coment I asked: How (in)correct are the other answers? I-am-not-a-bioinformatics-programmer.

As a variation, how many other questions do you not like?

http://rosalind.info/problems/hamm/ computes the Hamming distance between two strings, using the simplification that only point mutations are important. This is of course not a true reflection of the science behind comparing two DNA strings.

Do you therefore also not like that question? Which others don't you like, because of simplifications they don't explain?

As an education tool, is it not useful to discard complexity in the process of bootstraping towards the full details?

Ha! I had heard the phrase "lie-to-children" before, which seemed relevant to this thread. https://en.wikipedia.org/wiki/Lie-to-children says:

> A lie-to-children (plural lies-to-children) is a simplified explanation of technical or complex subjects as a teaching method for children and laypeople. The technique has been incorporated by academics within the fields of biology, evolution, bioinformatics and the social sciences.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: