MediaWiki API result

This is the HTML representation of the JSON format. HTML is good for debugging, but is unsuitable for application use.

Specify the format parameter to change the output format. To see the non-HTML representation of the JSON format, set format=json.

See the complete documentation, or the API help for more information.

{
    "batchcomplete": "",
    "continue": {
        "gapcontinue": "Repeat_Library_Construction-Advanced",
        "continue": "gapcontinue||"
    },
    "warnings": {
        "main": {
            "*": "Subscribe to the mediawiki-api-announce mailing list at <https://lists.wikimedia.org/mailman/listinfo/mediawiki-api-announce> for notice of API deprecations and breaking changes."
        },
        "revisions": {
            "*": "Because \"rvslots\" was not specified, a legacy format has been used for the output. This format is deprecated, and in the future the new format will always be used."
        }
    },
    "query": {
        "pages": {
            "35": {
                "pageid": 35,
                "ns": 0,
                "title": "Registering with GenBank",
                "revisions": [
                    {
                        "contentformat": "text/x-wiki",
                        "contentmodel": "wikitext",
                        "*": "This page will contain a stepwise tutorial for registering a prefix for your project with GenBank"
                    }
                ]
            },
            "3": {
                "pageid": 3,
                "ns": 0,
                "title": "Repeat Library Construction--Basic",
                "revisions": [
                    {
                        "contentformat": "text/x-wiki",
                        "contentmodel": "wikitext",
                        "*": "This page describes the process of generating a species specific repeat library suitable for repeat masking prior to protein coding gene annotation with MAKER. This is achieved by a repeat collection tool (RepeatModeler) that collects sequences reaching a certain copy number. The repetitive sequences are then classified based on their similarity to known transposable elements. As a result, low copy number transposable elements are not included in the collection. Moreover, a substantial amount of sequences cannot be classified. For a more comprehensive collection of repetitive elements as well as better classification see [[Repeat Library Construction--Advanced]].\n \n''Content contributed by [http://www.hrt.msu.edu/ning-jiang/ Dr. Ning Jiang]''\n\nBuilding custom repeat library for plant genomes \u2013 Basic protocol\n\n== 1.   Collecting repetitive sequences by [http://www.repeatmasker.org/RepeatModeler.html RepeatModeler] ==\n\nThe genomic sequence  (called seqfile,in fasta format) was processed by RepeatModeler\n\nFirst command:\n DIR/BuildDatabase -name seqfiledb -engine ncbi seqfile\n*DIR = path where RepeatModeler is.\n*\u201c-engine ncbi\u201d refers to the NCBI blast program that was used as the alignment tool. \n\nSecond command:\n nohup DIR/RepeatModeler -database seqfiledb >& seqfile.out\n\n*After implementation of the commands, the RepeatModeler program generates a directory called \u201cRM\u2026\u201d. Inside the directory there is a document called \u201cconsensi.fa.classified\u201d that contains all the repetitive sequences. The definition line of each sequence contains the sequence name and the identity in RepeatMasker format. If the sequence is unidentified, it is marked as \u201cUnknown\u201d.\n*In our study, these with identities were put in ModelerID.lib and these with \u201cUnkown\u201d were in Modelerunknown.lib.\n*Sequences in Modelerunknown.lib were searched against a transposase database (derived from [http://www.repeatmasker.org/ RepeatMasker] package and [http://www.ncbi.nlm.nih.gov/pubmed/21535899 Kennedy et al (2011)]) and sequences matching transposase were considered as transposons belonging to the relevant superfamily and were incorporated into ModelerID.lib and excluded from Modelerunknown.lib.\n\n==2.      Exclusion of gene fragments ==\n\n*All repeats collected by RepeatModeler were used to search against a plant protein database  where transposon protein were excluded. Sequences match the plants proteins (considered as gene fragments) as well as 50 bp flanking sequences were excluded. After the exclusion if the remainder sequences were shorter than 50 bp, the entire sequence was excluded. A package for conducting this task is available at [http://weatherby.genetics.utah.edu/MAKER/data/ProtExcluder1.1.tar.gz here] ([http://weatherby.genetics.utah.edu/MAKER/data/ProtExcluder1.1Manual.docx manual]).\nAfter exclusion of putative gene fragments, ModelerID.lib were considered as know TE sequences (AtBasicTE.lib)\n*AtBasicTE.lib was combined with Modelerunknown.lib (after exclusion of gene fragments) to form AtBasicAllRepeat.lib.\n*It is conceivable that the sequences in AtBasicTE.lib are relatively reliable transposons but this library does not contain all repeats (repeat numbers are underestimated). If this library is used, certain repeats are left out and maybe annotated as genes or portion of genes. On the other hand, AtBasicAllRepeat.lib is more comprehensive  but may contain sequences from novel gene families that are not present in the existing plant protein database, so the repeat number may be overestimated in this library and novel gene families might be masked."
                    }
                ]
            }
        }
    }
}