Ña niyu'u nchu'a

Vertical Tabs

Reader
<?xml version="1.0" encoding="UTF-8"?>
<?xml-model href="http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng" type="application/xml" schematypens="http://relaxng.org/ns/structure/1.0"?>
<?xml-model href="http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng" type="application/xml"
	schematypens="http://purl.oclc.org/dsdl/schematron"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0" xml:id="L146-tok" xml:base="Corpus-MIX-SIL-TEI.xml#L146">
   <teiHeader>
      <fileDesc>
         <titleStmt>
            <title xml:lang="mix"><seg xml:id="L146-00-01"><seg xml:id="L146-00-01-01">Ña</seg> <seg xml:id="L146-00-01-02">niyu'u</seg> <seg xml:id="L146-00-01-03">nchu'a</seg></seg></title>
            <title xml:lang="esp"><seg xml:id="L146-00-02"><seg xml:id="L146-00-02-01">El</seg> <seg xml:id="L146-00-02-02">niño</seg> <seg xml:id="L146-00-02-03">que</seg> <seg xml:id="L146-00-02-04">se</seg> <seg xml:id="L146-00-02-05">asustó</seg> <seg xml:id="L146-00-02-06">mucho</seg></seg></title>
         </titleStmt>
         <publicationStmt>
            <p> This is an XML-TEI markup of the original publications by SIL Mexico for the purposes of:
               archival and for the creation of a body of Mixtepec-Mixtec language resources (LR) (ISO: mix).
               Encoded documents used with the permission of original authors.</p>
         </publicationStmt>
         <notesStmt>
            <note xml:id="L146-resumen" xml:lang="esp">Este cuento relata cómo a un niño se le quitaron
               las ganas de matar pájaros.
            </note>
         </notesStmt>
         <sourceDesc>
            <bibl xml:id="bibl.L146"><author>María Gómez Hernández</author><editor>Gisela Beckmann</editor><editor>María M. Nieves</editor><publisher>Instituto Lingüístico de Verano, A.C.</publisher>
               <pubPlace>Tlalpan, D.F., México</pubPlace><edition>second</edition><date>2008</date>
               Obtained from:<ptr target="http://www-01.sil.org/mexico/mixteca/mixtepec/L146-NinoAsusto-mix.htm"/>
            </bibl>
         </sourceDesc>
      </fileDesc>
      <encodingDesc>
         <p>
            <list>
               <item><label>segment</label>
                  <elementSpec ident="seg">
                     <desc>In this document (and in all others in the SIL collection) the 'seg' element is used to encapsulate:
                        <list>
                           <item>i) the entire sentences from original document (first level);</item>
                           <item>ii) the individual lexical items contained within the sentences (second level);</item>
                        </list>
                        Since the segments are not all orthographic/prose in nature, as many contain mathematical word problems, or completely numerical equations (some contain a mixture of both), 
                        the segments in this document that are not entirely in prose are given the xml-tei attributes: 
                     </desc>
                  </elementSpec>
               </item>
               <item><label>character</label>
                  <elementSpec ident="c">
                     <desc>Within the sentence level "seg" (segment) elements all punctuation characters (".", ";", ":", "¿", "?", "!", etc.) are encapsulated in the "c" (character) element;
                        In future stages of the project, each of these punctuation characters will be formally linked with the linguistic categories with which it is associated:<eg> a sentence that begins with: "¿"
                           (and ends with: "?") can be immediately recognized as sentence type "question/interrogative";</eg>
                     </desc>
                  </elementSpec>
               </item>
               <item><label>division</label>
                  <elementSpec ident="div">
                     <desc>Where the page divisions in the original documents are, this encoding uses "div" (division) elements; this is to separate the linguistic, and knowledge data from formatting</desc>
                  </elementSpec>     
               </item>
            </list>
         </p>
         <classDecl>
            <taxonomy xml:id="tax.sil-mix">
               <category xml:id="pedagogical">
                  <catDesc>PEDAGOGICAL</catDesc>
                  <category xml:id="pedagogical-inter">
                     <catDesc>PEDAGOGICAL:INTERACTIVE</catDesc>
                  </category>
                  <category xml:id="pedagogical-ref">
                     <catDesc>PEDAGOGICAL:REFERENCE</catDesc>
                  </category>
               </category>
               <category xml:id="fiction">
                  <catDesc>FICTION</catDesc>
                  <category xml:id="fiction-fantasy">
                     <catDesc>FICTION:FANTASY</catDesc>
                  </category>
                  <category xml:id="fiction-realistic">
                     <catDesc>FICTION:REALISTIC</catDesc>
                  </category>
               </category>
               <category xml:id="folklore">
                  <catDesc>FOLKLORE</catDesc>
               </category>
            </taxonomy>
         </classDecl>
         <projectDesc>
            <p>
               This portion of the lexicography project deals with the only known body of literature in Mixtepec-Mixtec language which is comprised of series of 34* orthographic children's texts published by SIL Mexico (Summer Institute of Linguistics). The target audience of these documents is primarily young MIX speakers and are designed for use as classroom handouts and/or lessons for primary/elementary school-level. The topical content of these publications contain both culturally specific, and non-culturally specific subject matter supplemented by illustrations. Topics covered in these documents include: vocabulary, mathematics, telling time, geography, seasons, weather, local agricultural practices, fables, local Mixtec legends amongst others. These publications make up the second most important collection in this project and they remain the only source of reference for examples of how the working orthography of the Mixtec language is applied to the Mixtepec variety.
               The encoding of this portion of the MIX Language resources can be divided into the following components:
               <list>
                  <item>i) TEI-XML markup of documents that supports reuse and extension within this project and/or for other interested parties;</item>
                  
                  <item>ii) Annotation and glossing of lexical information and structures in documents; tasks include;
                     
                     <list>
                        <item>glossing of orthographical lexical items;</item>
                        <item>assignment of lexical properties to sentences, phrases and lexical items (morpho-/syntactic; pragmatic; semantic etc.);</item>
                        <item>linking corresponding bilingual items (Spanish-Mixtec) within each document;</item>
                        <item>entry of analyzed, glossed and categorized lexical information into TEI-Dictionary;</item>
                        <item>link orthographical forms of lexical items with phonetic transcriptions of spoken forms;</item></list></item>
                  
                  <item>iii) Classification of Document Types and Ontology Linking;
                     <list>
                        <item>replace current SIL document classification model<ref target="#tax.sil-mix"/> with established and expansible document system which can be used to classify any new document or literature produced in the language in the future;</item>
                        <item>as per the goal of the overall project, the use of language-specific glosses (including 'Interlingua') will be replaced by grounding the semantic profile of each lexical item by referencing ontological entities (from existing ontologies such as DBpedia, SKOS, etc.);</item>
                     </list>
                  </item>
               </list>
            </p>
         </projectDesc>
      </encodingDesc>
   </teiHeader>
   <text decls="#fiction-realistic">
      <body>
         <div xml:id="L146-01">
            <head><graphic url="L146-01.png"/></head>
            <p>
               <seg xml:id="L146-01-01"><seg xml:id="L146-01-01-01">Iin</seg> <seg xml:id="L146-01-01-02">kii</seg> <seg xml:id="L146-01-01-03">ntsio</seg> <seg xml:id="L146-01-01-04">iin</seg> <seg xml:id="L146-01-01-05">ñasuii</seg> <seg xml:id="L146-01-01-06">luu</seg><c>.</c></seg>
               <seg xml:id="L146-01-02"><seg xml:id="L146-01-02-01">Saan</seg> <seg xml:id="L146-01-02-02">so</seg> <seg xml:id="L146-01-02-03">ta'an</seg> <seg xml:id="L146-01-02-04">ta</seg> <seg xml:id="L146-01-02-05">inia</seg> <seg xml:id="L146-01-02-06">ka'nia</seg> <seg xml:id="L146-01-02-07">saa</seg><c>.</c></seg>
               <seg xml:id="L146-01-03"><seg xml:id="L146-01-03-01">Ta</seg> <seg xml:id="L146-01-03-02">ntsinia</seg> <seg xml:id="L146-01-03-03">iin</seg> <seg xml:id="L146-01-03-04">saa</seg> <seg xml:id="L146-01-03-05">ra</seg> <seg xml:id="L146-01-03-06">ntanai</seg> <seg xml:id="L146-01-03-07">ti</seg> <seg xml:id="L146-01-03-08">kua'in</seg> <seg xml:id="L146-01-03-09">saan</seg> <seg xml:id="L146-01-03-10">ni</seg><c>.</c></seg>
               <seg xml:id="L146-01-04"><seg xml:id="L146-01-04-01">Sara</seg> <seg xml:id="L146-01-04-02">iin</seg> <seg xml:id="L146-01-04-03">kii</seg> <seg xml:id="L146-01-04-04">ra</seg> <seg xml:id="L146-01-04-05">ntsinia</seg> <seg xml:id="L146-01-04-06">iin</seg> <seg xml:id="L146-01-04-07">saa</seg> <seg xml:id="L146-01-04-08">luu</seg> <seg xml:id="L146-01-04-09">vii</seg> <seg xml:id="L146-01-04-10">ta</seg> <seg xml:id="L146-01-04-11">tumi</seg> <seg xml:id="L146-01-04-12">ti</seg><c>,</c> <seg xml:id="L146-01-04-13">ra</seg> <seg xml:id="L146-01-04-14">ntantuki</seg> <seg xml:id="L146-01-04-15">uli</seg> <seg xml:id="L146-01-04-16">ña</seg> <seg xml:id="L146-01-04-17">ra</seg> <seg xml:id="L146-01-04-18">nikitsai</seg> <seg xml:id="L146-01-04-19">ki'nia</seg> <seg xml:id="L146-01-04-20">ti</seg><c>.</c></seg>
            </p>
         </div>
         <div xml:id="L146-02">
            <head><graphic url="L146-02.png"/></head>
            <p>
               <seg xml:id="L146-02-01"><seg xml:id="L146-02-01-01">Sara</seg> <seg xml:id="L146-02-01-02">saa</seg> <seg xml:id="L146-02-01-03">luu</seg> <seg xml:id="L146-02-01-04">ka</seg> <seg xml:id="L146-02-01-05">ntava</seg> <seg xml:id="L146-02-01-06">ti</seg><c>,</c> <seg xml:id="L146-02-01-07">ntsikuntu'u</seg> <seg xml:id="L146-02-01-08">ti</seg> <seg xml:id="L146-02-01-09">nta'a</seg> <seg xml:id="L146-02-01-10">iin</seg> <seg xml:id="L146-02-01-11">yutu</seg> <seg xml:id="L146-02-01-12">ra</seg> <seg xml:id="L146-02-01-13">ntsa'an</seg> <seg xml:id="L146-02-01-14">tuku</seg> <seg xml:id="L146-02-01-15">ñasuii</seg> <seg xml:id="L146-02-01-16">luu</seg> <seg xml:id="L146-02-01-17">ka</seg><c>,</c> <seg xml:id="L146-02-01-18">niki'ni</seg> <seg xml:id="L146-02-01-19">tuki</seg> <seg xml:id="L146-02-01-20">ti</seg> <seg xml:id="L146-02-01-21">ra</seg> <seg xml:id="L146-02-01-22">niyu'u</seg> <seg xml:id="L146-02-01-23">saa</seg> <seg xml:id="L146-02-01-24">luu</seg> <seg xml:id="L146-02-01-25">ka</seg><c>.</c></seg>
            </p>
         </div>
         <div xml:id="L146-03">
            <head><graphic url="L146-03.png"/></head>
            <p>
               <seg xml:id="L146-03-01"><seg xml:id="L146-03-01-01">Sara</seg> <seg xml:id="L146-03-01-02">ntava</seg> <seg xml:id="L146-03-01-03">ti</seg> <seg xml:id="L146-03-01-04">saan</seg> <seg xml:id="L146-03-01-05">ni</seg> <seg xml:id="L146-03-01-06">mancha</seg> <seg xml:id="L146-03-01-07">yuku</seg> <seg xml:id="L146-03-01-08">sara</seg> <seg xml:id="L146-03-01-09">ntanaa</seg> <seg xml:id="L146-03-01-10">tuku</seg> <seg xml:id="L146-03-01-11">ñasuii</seg> <seg xml:id="L146-03-01-12">luu</seg> <seg xml:id="L146-03-01-13">ka</seg> <seg xml:id="L146-03-01-14">saan</seg> <seg xml:id="L146-03-01-15">ni</seg><c>.</c></seg>
            </p>
         </div>
         <div xml:id="L146-04">
            <head><graphic url="L146-04.png"/></head>
            <p>
               <seg xml:id="L146-04-01"><seg xml:id="L146-04-01-01">Sara</seg> <seg xml:id="L146-04-01-02">kunkuaa</seg> <seg xml:id="L146-04-01-03">ra</seg> <seg xml:id="L146-04-01-04">saa</seg> <seg xml:id="L146-04-01-05">luu</seg> <seg xml:id="L146-04-01-06">ka</seg> <seg xml:id="L146-04-01-07">ntava</seg> <seg xml:id="L146-04-01-08">ti</seg> <seg xml:id="L146-04-01-09">kua'an</seg> <seg xml:id="L146-04-01-10">ti</seg><c>.</c></seg>
               <seg xml:id="L146-04-02"><seg xml:id="L146-04-02-01">Sara</seg> <seg xml:id="L146-04-02-02">ñasuii</seg> <seg xml:id="L146-04-02-03">luu</seg> <seg xml:id="L146-04-02-04">ka</seg> <seg xml:id="L146-04-02-05">nikintoi</seg> <seg xml:id="L146-04-02-06">ika</seg> <seg xml:id="L146-04-02-07">ñantu'i</seg> <seg xml:id="L146-04-02-08">nta'ia</seg> <seg xml:id="L146-04-02-09">vari</seg> <seg xml:id="L146-04-02-10">nikuaa</seg><c>.</c></seg>
            </p>
         </div>
         <div xml:id="L146-05">
            <head><graphic url="L146-05.png"/></head>
            <p>
               <seg xml:id="L146-05-01"><seg xml:id="L146-05-01-01">Sara</seg> <seg xml:id="L146-05-01-02">ta</seg> <seg xml:id="L146-05-01-03">ntuu</seg> <seg xml:id="L146-05-01-04">kii</seg> <seg xml:id="L146-05-01-05">xchaan</seg> <seg xml:id="L146-05-01-06">nta'i</seg> <seg xml:id="L146-05-01-07">iin</seg> <seg xml:id="L146-05-01-08">ñasuii</seg> <seg xml:id="L146-05-01-09">luu</seg> <seg xml:id="L146-05-01-10">ka</seg> <seg xml:id="L146-05-01-11">ñantu'i</seg><c>.</c></seg>
               <seg xml:id="L146-05-02"><seg xml:id="L146-05-02-01">Sara</seg> <seg xml:id="L146-05-02-02">iin</seg> <seg xml:id="L146-05-02-03">chaa</seg> <seg xml:id="L146-05-02-04">ntuku</seg> <seg xml:id="L146-05-02-05">tutu</seg> <seg xml:id="L146-05-02-06">ntsini</seg> <seg xml:id="L146-05-02-07">so'o</seg> <seg xml:id="L146-05-02-08">ra</seg> <seg xml:id="L146-05-02-09">ña</seg> <seg xml:id="L146-05-02-10">nta'ia</seg> <seg xml:id="L146-05-02-11">ra</seg> <seg xml:id="L146-05-02-12">ntsa'an</seg> <seg xml:id="L146-05-02-13">ra</seg> <seg xml:id="L146-05-02-14">ntsanche'e</seg> <seg xml:id="L146-05-02-15">ra</seg> <seg xml:id="L146-05-02-16">nuu</seg> <seg xml:id="L146-05-02-17">ñantu'i</seg><c>.</c></seg>
            </p>
         </div>
         <div xml:id="L146-06">
            <head><graphic url="L146-06.png"/></head>
            <p>
               <seg xml:id="L146-06-01"><seg xml:id="L146-06-01-01">Sara</seg> <seg xml:id="L146-06-01-02">nikachi</seg> <seg xml:id="L146-06-01-03">ra</seg>:</seg>
               <seg xml:id="L146-06-02"><c>—</c><c>¿</c><seg xml:id="L146-06-02-01">Nchii</seg> <seg xml:id="L146-06-02-02">sa'u</seg> <seg xml:id="L146-06-02-03">ña</seg> <seg xml:id="L146-06-02-04">nta'iu</seg><c>?</c></seg>
               <seg xml:id="L146-06-03"><seg xml:id="L146-06-03-01">Sara</seg> <seg xml:id="L146-06-03-02">nikachi</seg> <seg xml:id="L146-06-03-03">ñasuii</seg> <seg xml:id="L146-06-03-04">luu</seg> <seg xml:id="L146-06-03-05">ka</seg>:</seg>
               <seg xml:id="L146-06-04"><c>—</c><seg xml:id="L146-06-04-01">Nta'i</seg> <seg xml:id="L146-06-04-02">yu</seg> <seg xml:id="L146-06-04-03">vari</seg> <seg xml:id="L146-06-04-04">niyu'u</seg> <seg xml:id="L146-06-04-05">ta</seg> <seg xml:id="L146-06-04-06">yu</seg><c>.</c></seg>
               <seg xml:id="L146-06-05"><seg xml:id="L146-06-05-01">Sara</seg> <seg xml:id="L146-06-05-02">nikachi</seg> <seg xml:id="L146-06-05-03">ñachaa</seg> <seg xml:id="L146-06-05-04">ka</seg>:</seg>
               <seg xml:id="L146-06-06"><c>—</c><seg xml:id="L146-06-06-01">Ma</seg> <seg xml:id="L146-06-06-02">nta'iu</seg><c>,</c> <seg xml:id="L146-06-06-03">ra</seg> <seg xml:id="L146-06-06-04">michu'mi</seg> <seg xml:id="L146-06-06-05">naka'ncha</seg> <seg xml:id="L146-06-06-06">yu</seg> <seg xml:id="L146-06-06-07">tutu</seg><c>,</c> <seg xml:id="L146-06-06-08">ra</seg> <seg xml:id="L146-06-06-09">nano'o</seg><c>,</c> <seg xml:id="L146-06-06-10">naku'un</seg> <seg xml:id="L146-06-06-11">yu</seg> <seg xml:id="L146-06-06-12">kuntasya'a</seg> <seg xml:id="L146-06-06-13">yo</seg> <seg xml:id="L146-06-06-14">ve'u</seg><c>,</c> <seg xml:id="L146-06-06-15">ra</seg> <seg xml:id="L146-06-06-16">miki</seg> <seg xml:id="L146-06-06-17">ka</seg> <seg xml:id="L146-06-06-18">kaku</seg> <seg xml:id="L146-06-06-19">ka'nu</seg> <seg xml:id="L146-06-06-20">saa</seg><c>.</c></seg>
            </p>
         </div>
         <div xml:id="L146-07">
            <head><graphic url="L146-07.png"/></head>
            <p>
               <seg xml:id="L146-07-01"><seg xml:id="L146-07-01-01">Sara</seg> <seg xml:id="L146-07-01-02">ñasuii</seg> <seg xml:id="L146-07-01-03">luu</seg> <seg xml:id="L146-07-01-04">ka</seg> <seg xml:id="L146-07-01-05">niyu'u</seg> <seg xml:id="L146-07-01-06">xiin</seg> <seg xml:id="L146-07-01-07">ra</seg> <seg xml:id="L146-07-01-08">miki</seg> <seg xml:id="L146-07-01-09">ka</seg> <seg xml:id="L146-07-01-10">nitsa'in</seg> <seg xml:id="L146-07-01-11">ka'nia</seg> <seg xml:id="L146-07-01-12">saa</seg><c>.</c></seg>
            </p>
         </div>
      </body>
   </text>
</TEI>
Ña niyu'u nchu'a El niño que se asustó mucho

This is an XML-TEI markup of the original publications by SIL Mexico for the purposes of: archival and for the creation of a body of Mixtepec-Mixtec language resources (LR) (ISO: mix). Encoded documents used with the permission of original authors.

Este cuento relata cómo a un niño se le quitaron las ganas de matar pájaros. María Gómez Hernández Gisela Beckmann María M. Nieves Instituto Lingüístico de Verano, A.C. Tlalpan, D.F., México second 2008 Obtained from:http://www-01.sil.org/mexico/mixteca/mixtepec/L146-NinoAsusto-mix.htm

In this document (and in all others in the SIL collection) the 'seg' element is used to encapsulate: i) the entire sentences from original document (first level); ii) the individual lexical items contained within the sentences (second level); Since the segments are not all orthographic/prose in nature, as many contain mathematical word problems, or completely numerical equations (some contain a mixture of both), the segments in this document that are not entirely in prose are given the xml-tei attributes: Within the sentence level "seg" (segment) elements all punctuation characters (".", ";", ":", "¿", "?", "!", etc.) are encapsulated in the "c" (character) element; In future stages of the project, each of these punctuation characters will be formally linked with the linguistic categories with which it is associated: a sentence that begins with: "¿" (and ends with: "?") can be immediately recognized as sentence type "question/interrogative"; Where the page divisions in the original documents are, this encoding uses "div" (division) elements; this is to separate the linguistic, and knowledge data from formatting

PEDAGOGICAL PEDAGOGICAL:INTERACTIVE PEDAGOGICAL:REFERENCE FICTION FICTION:FANTASY FICTION:REALISTIC FOLKLORE

This portion of the lexicography project deals with the only known body of literature in Mixtepec-Mixtec language which is comprised of series of 34* orthographic children's texts published by SIL Mexico (Summer Institute of Linguistics). The target audience of these documents is primarily young MIX speakers and are designed for use as classroom handouts and/or lessons for primary/elementary school-level. The topical content of these publications contain both culturally specific, and non-culturally specific subject matter supplemented by illustrations. Topics covered in these documents include: vocabulary, mathematics, telling time, geography, seasons, weather, local agricultural practices, fables, local Mixtec legends amongst others. These publications make up the second most important collection in this project and they remain the only source of reference for examples of how the working orthography of the Mixtec language is applied to the Mixtepec variety. The encoding of this portion of the MIX Language resources can be divided into the following components: i) TEI-XML markup of documents that supports reuse and extension within this project and/or for other interested parties; ii) Annotation and glossing of lexical information and structures in documents; tasks include; glossing of orthographical lexical items; assignment of lexical properties to sentences, phrases and lexical items (morpho-/syntactic; pragmatic; semantic etc.); linking corresponding bilingual items (Spanish-Mixtec) within each document; entry of analyzed, glossed and categorized lexical information into TEI-Dictionary; link orthographical forms of lexical items with phonetic transcriptions of spoken forms; iii) Classification of Document Types and Ontology Linking; replace current SIL document classification model#tax.sil-mix with established and expansible document system which can be used to classify any new document or literature produced in the language in the future; as per the goal of the overall project, the use of language-specific glosses (including 'Interlingua') will be replaced by grounding the semantic profile of each lexical item by referencing ontological entities (from existing ontologies such as DBpedia, SKOS, etc.);

Iin kii ntsio iin ñasuii luu . Saan so ta'an ta inia ka'nia saa . Ta ntsinia iin saa ra ntanai ti kua'in saan ni . Sara iin kii ra ntsinia iin saa luu vii ta tumi ti , ra ntantuki uli ña ra nikitsai ki'nia ti .

Sara saa luu ka ntava ti , ntsikuntu'u ti nta'a iin yutu ra ntsa'an tuku ñasuii luu ka , niki'ni tuki ti ra niyu'u saa luu ka .

Sara ntava ti saan ni mancha yuku sara ntanaa tuku ñasuii luu ka saan ni .

Sara kunkuaa ra saa luu ka ntava ti kua'an ti . Sara ñasuii luu ka nikintoi ika ñantu'i nta'ia vari nikuaa .

Sara ta ntuu kii xchaan nta'i iin ñasuii luu ka ñantu'i . Sara iin chaa ntuku tutu ntsini so'o ra ña nta'ia ra ntsa'an ra ntsanche'e ra nuu ñantu'i .

Sara nikachi ra: ¿ Nchii sa'u ña nta'iu ? Sara nikachi ñasuii luu ka: Nta'i yu vari niyu'u ta yu . Sara nikachi ñachaa ka: Ma nta'iu , ra michu'mi naka'ncha yu tutu , ra nano'o , naku'un yu kuntasya'a yo ve'u , ra miki ka kaku ka'nu saa .

Sara ñasuii luu ka niyu'u xiin ra miki ka nitsa'in ka'nia saa .

Toolbox

Themes:

Ña niyu'u nchu'a El niño que se asustó mucho

This is an XML-TEI markup of the original publications by SIL Mexico for the purposes of: archival and for the creation of a body of Mixtepec-Mixtec language resources (LR) (ISO: mix). Encoded documents used with the permission of original authors.

Este cuento relata cómo a un niño se le quitaron las ganas de matar pájaros. María Gómez Hernández Gisela Beckmann María M. Nieves Instituto Lingüístico de Verano, A.C. Tlalpan, D.F., México second 2008 Obtained from:http://www-01.sil.org/mexico/mixteca/mixtepec/L146-NinoAsusto-mix.htm

In this document (and in all others in the SIL collection) the 'seg' element is used to encapsulate: i) the entire sentences from original document (first level); ii) the individual lexical items contained within the sentences (second level); Since the segments are not all orthographic/prose in nature, as many contain mathematical word problems, or completely numerical equations (some contain a mixture of both), the segments in this document that are not entirely in prose are given the xml-tei attributes: Within the sentence level "seg" (segment) elements all punctuation characters (".", ";", ":", "¿", "?", "!", etc.) are encapsulated in the "c" (character) element; In future stages of the project, each of these punctuation characters will be formally linked with the linguistic categories with which it is associated: a sentence that begins with: "¿" (and ends with: "?") can be immediately recognized as sentence type "question/interrogative"; Where the page divisions in the original documents are, this encoding uses "div" (division) elements; this is to separate the linguistic, and knowledge data from formatting

PEDAGOGICAL PEDAGOGICAL:INTERACTIVE PEDAGOGICAL:REFERENCE FICTION FICTION:FANTASY FICTION:REALISTIC FOLKLORE

This portion of the lexicography project deals with the only known body of literature in Mixtepec-Mixtec language which is comprised of series of 34* orthographic children's texts published by SIL Mexico (Summer Institute of Linguistics). The target audience of these documents is primarily young MIX speakers and are designed for use as classroom handouts and/or lessons for primary/elementary school-level. The topical content of these publications contain both culturally specific, and non-culturally specific subject matter supplemented by illustrations. Topics covered in these documents include: vocabulary, mathematics, telling time, geography, seasons, weather, local agricultural practices, fables, local Mixtec legends amongst others. These publications make up the second most important collection in this project and they remain the only source of reference for examples of how the working orthography of the Mixtec language is applied to the Mixtepec variety. The encoding of this portion of the MIX Language resources can be divided into the following components: i) TEI-XML markup of documents that supports reuse and extension within this project and/or for other interested parties; ii) Annotation and glossing of lexical information and structures in documents; tasks include; glossing of orthographical lexical items; assignment of lexical properties to sentences, phrases and lexical items (morpho-/syntactic; pragmatic; semantic etc.); linking corresponding bilingual items (Spanish-Mixtec) within each document; entry of analyzed, glossed and categorized lexical information into TEI-Dictionary; link orthographical forms of lexical items with phonetic transcriptions of spoken forms; iii) Classification of Document Types and Ontology Linking; replace current SIL document classification model with established and expansible document system which can be used to classify any new document or literature produced in the language in the future; as per the goal of the overall project, the use of language-specific glosses (including 'Interlingua') will be replaced by grounding the semantic profile of each lexical item by referencing ontological entities (from existing ontologies such as DBpedia, SKOS, etc.);

Iin kii ntsio iin ñasuii luu . Saan so ta'an ta inia ka'nia saa . Ta ntsinia iin saa ra ntanai ti kua'in saan ni . Sara iin kii ra ntsinia iin saa luu vii ta tumi ti , ra ntantuki uli ña ra nikitsai ki'nia ti .

Sara saa luu ka ntava ti , ntsikuntu'u ti nta'a iin yutu ra ntsa'an tuku ñasuii luu ka , niki'ni tuki ti ra niyu'u saa luu ka .

Sara ntava ti saan ni mancha yuku sara ntanaa tuku ñasuii luu ka saan ni .

Sara kunkuaa ra saa luu ka ntava ti kua'an ti . Sara ñasuii luu ka nikintoi ika ñantu'i nta'ia vari nikuaa .

Sara ta ntuu kii xchaan nta'i iin ñasuii luu ka ñantu'i . Sara iin chaa ntuku tutu ntsini so'o ra ña nta'ia ra ntsa'an ra ntsanche'e ra nuu ñantu'i .

Sara nikachi ra: ¿ Nchii sa'u ña nta'iu ? Sara nikachi ñasuii luu ka: Nta'i yu vari niyu'u ta yu . Sara nikachi ñachaa ka: Ma nta'iu , ra michu'mi naka'ncha yu tutu , ra nano'o , naku'un yu kuntasya'a yo ve'u , ra miki ka kaku ka'nu saa .

Sara ñasuii luu ka niyu'u xiin ra miki ka nitsa'in ka'nia saa .