$accession
$accession : array
Class ParseSwissprotManager
$accession : array
$sequence : \Amelaye\BioPHP\Domain\Sequence\Entity\Sequence
$authors : array
$features : array
$keywords : array
$references : array
$srcForm : \Amelaye\BioPHP\Domain\Sequence\Entity\SrcForm
$gbSequence : \Amelaye\BioPHP\Domain\Sequence\Entity\GbSequence
$spDatabank : array
$aLines : array
parseDataFile(array $aFlines) : \Amelaye\BioPHP\Domain\Sequence\Entity\Sequence
Parses a Swissprot data file and returns a Seq object containing parsed data.
Parses the Feature Table lines (those that begin with FT) in a Swissprot data file, extracts the feature key name, from endpoint, to endpoint, and description, and stores them in a (simple) array.
array | $aFlines | The lines the script has to parse |
$oSequence
buildDRField() : mixed
Parses DR lines Format : DR DATA_BANK_IDENTIFIER; PRIMARY_IDENTIFIER; SECONDARY_IDENTIFIER We assume that all three data items are mandatory/present in all DR entries.
( refno => ( (dbname1, pid1, sid1), (dbname2, pid2, sid2), ... ), 1 => ( ... ) ) ( 0 => ( (REBASE, pid1, sid1), (WORPEP, pid2, sid2), ... ), 1 => ( ... ) ) ( rn => ( "rp" => "my rp", "rc" => ("tok1" => "value", ...) ) ) ( 10 => ( "RP" => "my rp", "RC" => ("PLASMID" => "PLA_VAL", ... ) ) ) Example: DR AARHUS/GHENT-2DPAGE; 8006; IEF.
buildRNField(array $aFlines, array $aReferences, array $aAuthors) : mixed
Parses RN lines - This is a paragraph which contains several lines Example : RN [8] RP X-RAY CRYSTALLOGRAPHY (2.6 ANGSTROMS).
RX MEDLINE; 90008932. RA ECK M.J., SPRANG S.R.; RL J. BIOL. CHEM. 264:17595-17605(1989).
array | $aFlines | |
array | $aReferences | |
array | $aAuthors |
buildGNField( $aGename) : mixed
Parses GN line - GN is always exactly one line.
GNAME1 OR GNAME2 ( (GNAME1, GNAME2) ) GNAME1 AND GNAME2 ( (GNAME1), (GNAME2) ) GNAME1 AND (GNAME2 OR GNAME3) ( (GNAME1), (GNAME2, GNAME3) ) GNAME1 OR (GNAME2 AND GNAME3) NOT POSSIBLE!!! ALGORITHM: 1) Split expressions by " AND ". 2) Test each "token" if in between parentheses or not. 3) If not, then token is a singleton, else it's a multiple-ton. 4) Singletons are translated into (GNAME1). Multiple-tons are translated into (GNAME1, GNAME 2). 5) Push gene name array into larger array. Go to next token.
$aGename |