* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2020-03-13 9:56 Michał Górny
0 siblings, 0 replies; 14+ messages in thread
From: Michał Górny @ 2020-03-13 9:56 UTC (permalink / raw
To: gentoo-commits
commit: c2a377c1163cd9966cefe455ffaef3dd668fd202
Author: Michał Górny <mgorny <AT> gentoo <DOT> org>
AuthorDate: Thu Mar 12 13:35:24 2020 +0000
Commit: Michał Górny <mgorny <AT> gentoo <DOT> org>
CommitDate: Fri Mar 13 09:55:55 2020 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=c2a377c1
dev-python/nltk-data: New package for dev-python/nltk data files
Signed-off-by: Michał Górny <mgorny <AT> gentoo.org>
dev-python/nltk-data/Manifest | 107 ++++++++++++++
dev-python/nltk-data/metadata.xml | 14 ++
dev-python/nltk-data/nltk-data-20200312.ebuild | 184 +++++++++++++++++++++++++
3 files changed, 305 insertions(+)
diff --git a/dev-python/nltk-data/Manifest b/dev-python/nltk-data/Manifest
new file mode 100644
index 00000000000..fa415ee7f30
--- /dev/null
+++ b/dev-python/nltk-data/Manifest
@@ -0,0 +1,107 @@
+DIST nltk-abc-20200312.zip 1487851 BLAKE2B ba28be6f7d5fcd1840af11bae1e669233a151f9d6a5ae1a958ab5b5b8bf33eaac03a5172f87bd0dd481b1cd069fa107d0fadd6f5cf037eee4350652a58e21bcc SHA512 e2ee1fc1738c1ebfac50271e9c8d8396a603ceef2b1a6471fb38ba78a1a74e639f5c51bf93ada93f70832462bd7bc42621d1c5ccfeba0315abe8408f28788db0
+DIST nltk-alpino-20200312.zip 2797255 BLAKE2B 54cff649a36e0eefd74b217ee5c4fa1be464edebb5b4bbf004960487fe172e6e623b5fb9b104da34e109eac3dbed02397119b7998ed8db8b32dfd62ab436f4bd SHA512 0f0b40ca8c36068dabde193299fc00754b42f0ef42f562b96ec413b8f6400db9fe874fa750b198ac92fa7eb3cb8e4bf1ecd463aa95005dd4fb23825e47c4a772
+DIST nltk-averaged_perceptron_tagger-20200312.zip 2526731 BLAKE2B bc6bfc544c78df029727c4f809749902539310b1894803f83b07add698455bbd9ca0f5fbd358ef91978414714b2041c757a1bfbd1db985ba92e81fbf7ae937a8 SHA512 bda565462823373905ed73073aaa78fc390077f370fdd942e7681e7e9b6f78d41bc4b71e7aa9b4d601de5a5a886d4bbb14b062e1e7761629980a508054f7facc
+DIST nltk-averaged_perceptron_tagger_ru-20200312.zip 8628828 BLAKE2B 026ca0e6a511098029b45efa46dd336d37123575bd77682f8a6084c39327097c85052818921a5b0ce2f202924ddf2a24418025f8c979bef034aa0086cb583a17 SHA512 fd56c7f8188c2ddd6b6239189ffa701c56f2e47376a210bb0c92881969304874f720f27a6965e853db2bf43aff202f09dfea7bb30420590e1a75940493797db6
+DIST nltk-basque_grammars-20200312.zip 4704 BLAKE2B 024c6d86844bb3ea368b6dd71d065b4bd38b551576b3919f26e6c32139ebb3b5338c17453d766cef4dc7f5c358220b2738d16baf58c8cf794954e10a460100fa SHA512 52d07166551dadbf99830052595982b93c0f8c611c4051efc00dc2054a4a219b8a3ee57072f8bac51b34e5eb8e1e5631fc2668bf319d252fca201d798a58455a
+DIST nltk-biocreative_ppi-20200312.zip 223566 BLAKE2B ca749a1713992045363a7e4af7c912a750f6a17cc729f60fe1ccbe62cbb565a2c349c09764e91946753e630409382b4b1a1578a17b29e10d7bdba9818c6ef098 SHA512 b6dd5eee7849c1161afca851d22106671ba27cef63d4bf7f500b6c92625b31cbf4aa7593ae65c7eac25f532ce93f06faeb78e7131e35b5cc0c2d74a2c06d43d7
+DIST nltk-bllip_wsj_no_aux-20200312.zip 24516205 BLAKE2B cb52ef99d7505b956b31ecf0c02ae5538946ef6b9b639eaf7b2808795a3bb6e05188da20b8495d80a4a3077a6e8f1d53b2ebbadf5d14382e245841ae1919a839 SHA512 03bb1c3b6c27e0049e7b5fc7721e2903c1dea80b55fe35208a4acd67116618a646d98fc5184d3a43ac9ce3dc693fb8ca768a18f20d9d6eb69d91fd2a101cc9e5
+DIST nltk-book_grammars-20200312.zip 9103 BLAKE2B 4087df6322534a8b012df25587151afe6121ad857ed99cdac1cfa5c8f9aca4d2c623876fb65fa21b9f391c0e96401ef18a0f403bce343755fff38c7d85292012 SHA512 8cfa085e8203f9c415d18c0afb3e3cf9963214929f4de71cf5933234e3623e432cfca3a167cd7c2d9cdd4d22496cf959249790789782675cec392d0e0ef5549d
+DIST nltk-brown-20200312.zip 3314357 BLAKE2B f9c54f01d08ce2da90238a94e74c693ca60ad63b0cfdc141897942b4c01235f505d9b60f4ab0eb0aa63fc4b414922f5d71eed404486d4b31788f167e068c300c SHA512 f425e022f8f3eb459cf9ad7c90c326920430f7f0ad3e15d5961dfb5146ac355f7e13ba7681a78a26d3c5b146979eb778012aac4ce3dc5d59421a60798dede85d
+DIST nltk-brown_tei-20200312.zip 8737738 BLAKE2B 5386fc3c818e8f772760f8c8288fab1fb6f67d5f27e8aa47bc9366716b66c1b4c5fb52e0c9d52f1d2884ab426fe45afcb5d07dc497cf1a2e86dc4c06dd498cac SHA512 f3dcc36bcab63d481e4d833c8946f10163f732166114c8fdd63932fff9fba3c236593a082ebcdf96f74aea6d33e424b7be4c645fd0f5ee5090f0335544c02c47
+DIST nltk-cess_cat-20200312.zip 5396688 BLAKE2B b2c79b6d938b98118ed0f541ed75509bd47d2309123c97aff2aa1f03fe6da298c5270a9cd6fe3da91e57402b6bbf3953c88eab7e25c47b9a12720ff0367499e1 SHA512 6f14beaa98a5f0731fe126c7e10110045c7cac8f7893bea0a6895bc64e6953622f989e93c2e77f6d66d5a3024acc11422584d6563fc3efa84766757e2c5d4127
+DIST nltk-cess_esp-20200312.zip 2220392 BLAKE2B 16695534a2267c20d204e6ce19a19fbf053d3f4e06d8b52bf480f9c775b05381a902f56b0793aabb20ad07ed6334c3f24217dcfffb33bf32bfb3aaa6736dcbf0 SHA512 daca83bbe8d725fad5775f5c71946ae534764a758ccf428b3f48674eaa4a5a8342cb35f38ae9715ffa75030771b91f1303bb2b98a55922337e6b39dfa2e7a94a
+DIST nltk-chat80-20200312.zip 19209 BLAKE2B a9dc4eb6d0a6d081cff840a8cd6b3249ca2e26a6aa783f33fba770f3c56917537cfb00706464767b70d4b10339b443615272b909fc43efe5ce1e8d8dbcb8df2e SHA512 2d590ac4a785132be92764dda5577f2d7e7209cc1d58302296707d9ba8dd312dc01cdb104183922de88ba72f7cc3cd4b8fd95b2b18fd41321a27f4b964e4a500
+DIST nltk-city_database-20200312.zip 1708 BLAKE2B 070a7d51fa2d39b891a52178a535c11ad7122b34fb1f847b87b86e84db85d140d8b2ca8b029375bb517d38a1c92afdd86f358068f6ea097873713037978a5df2 SHA512 ca87a07da6b1fa1ad786d43d2d7083797a4b81f033248d6951fe2162a3b4b35d20eb46d74dc9cb7c9eda202a8b6cfe8899a3b02cb768be9d1f0b4be1a290ec2e
+DIST nltk-cmudict-20200312.zip 896069 BLAKE2B bcef4718794c180f8641dedf2d85bd338d440c470715bd34eb843b2487cd2b2e746808f9420adebe0757987d09def5c5547e797f4cfd49b91ff3274538836211 SHA512 d3b5d3f9675e242487953ffea9b93e8a8f66462fb09321303064c618e2483e13456d7bbb307b2791926a1e843fdc1df0cb29cf185e2adb6fd08f6286295f42c8
+DIST nltk-comparative_sentences-20200312.zip 279121 BLAKE2B b7b12e9922a2688586aedde8a50e01c29387793955b45c87eee2ad43e41353cba7a449e5746f41eb1b239263274fb91b4a69bda2359b6abd90e46b38acc061eb SHA512 38b383a8cde09c1e8269f047b89c364177507c57a330c361792e92116743adec6ee25202ddf672af041798cc558b63e1e35478283cfb365ef33f8657a462e978
+DIST nltk-comtrans-20200312.zip 11904518 BLAKE2B ae26bec3290d83673d0d894d62c22fac36d0fa4fa03e3989c0b924de65fac12f496bd730fa7512737dbebd59bd48e6b621e4234b0e26ab840c672fab555fbcb8 SHA512 c631d2d21f5a997548ad979e116e9ac642808bea0faa07fa93c57e0d455abb151b9c7d6badbc0b4ff4390994cb7736e722217a9c3da1c8d51da94d6e85629664
+DIST nltk-conll2000-20200312.zip 756607 BLAKE2B 31b4254f9eeb6a3c728193cb89a6e5eff59100c108ac7b19707e86d04f314bb2362e4e6a4770632308a535121ff2e847d8cdbea52835097429fe2111b1e580ac SHA512 421eb51171614175c27bc82fe69e1a6d22b426777c5e62f850ecc288f348feee3cfdf29e8fae9f59c53d47684ba031e108d8a29c5b9c27fc0d2f0225ab83f4c6
+DIST nltk-conll2002-20200312.zip 1867449 BLAKE2B 127a22ebf3fc2ac99e0b24641f2da1d122c67cbfd25eb59bb81ea430f1a08ed74c64b54b87b6457a903a164d2fcd08d4cec6ff7b98943024bc10a4cf54fc729a SHA512 0b9e575ed09771ca4ae5b6f5acf66466def823d9daae20eba4ac291fd501f2b651e018874b3f91dfb42b998abe8c9e0fb1ff9f2ef883f462b29c01547ba0d4d4
+DIST nltk-conll2007-20200312.zip 1242958 BLAKE2B dc01ab90a2d82f2fbe0b4276dc8c139626193f696197629b9bdcb75ae5da14739b319a4455dca870ae968c3fb00473836c56fce6e02f9ef5cc86fc7e44261d72 SHA512 3be03a672c9a4b823ed8c7cd3b571a7c0120415ef293f0e3044bf967f69b6efd413ee57cfe27a1407ec6fa2aad8e625019ddfcf0eec7a48ef8d745064adefba2
+DIST nltk-crubadan-20200312.zip 5288655 BLAKE2B 2f5103c9801b965ce5d893b690dedc5f8524b1266e32a01251834c80cf291963657b3c1366ef6ed21f9f89c9ab4a696de72a752ce53b651d596ade99edaace2e SHA512 cbb6a8c53a89bb68238853da2409b3bf04e832ede977bee673ad966a74f4c732ee6ba844b8171de26a641fc24f2e9f6d4755adeef3524583f8765d3237057726
+DIST nltk-dependency_treebank-20200312.zip 457429 BLAKE2B a60a95f9bcbb72b4ee19a91c380eb620261687f23bd867ae4cf2d7937b60b8dacafb52c09d7bcbcb9fbbbda4e57954ee55a900d70e69e7e290e53c668dd0775f SHA512 b225ace2e2413f0b9ae0e142d81052e64a1bcacb71caf7b07a577c18dec45dc429116abfd12edd3bb1442a111f4fa6e3e33e2b5f113e26dfc4e917268d821736
+DIST nltk-dolch-20200312.zip 2116 BLAKE2B a79b5d5de2ab03be43e5fdb1b34c70a458af6efce0e35f42f3c4d748c84cd950c98981f969fea2b0507ee405438f84b50e526aa94a93f99de7567ccb5c462215 SHA512 29254c95d11fd2001f486a5c682f7ab7f23fa13472ac4142db3702e2fa994f74ffabc2f571f1feb94a2e65fcb8da97c57a519f24504379b0e369fd27f9f867f0
+DIST nltk-europarl_raw-20200312.zip 12594977 BLAKE2B 295e274880ae34c9618a70cabc1b1075e35c442158aabe437972caebbf36d09c4f5f5b66d8ae7ac953f4312f9dac8f5e0c6d3346a444afe98ac700fb3e622539 SHA512 47dcd5420df6ab294f75204355cf346257a821d491f65b856c41a77237df1d7aeb352ed1353da55bfb68fa9f0d8782b9aac9c33bd49af3b395cc780dc421e26d
+DIST nltk-floresta-20200312.zip 1882021 BLAKE2B 092886cb15c6215b4fe979ce62a1a137de97c4749c372426211d36117d367697d19ebe6a9337f702ee9dfc133318b1e7de51393c5a41afc31a960e10ebe7c498 SHA512 1a71453b9629d622c3468b1e00501683e0415996a62a30c498486354b82f0b3be76080dddc60117c8c9f29de3b34a2166a5de87376a0daeaf8b92bfe5dac668a
+DIST nltk-framenet_v15-20200312.zip 69337891 BLAKE2B 046087cb20013f3a597034458dd09b4cb7378b19f91f5ebf7196a92a02741a67b671f094262ac5d8e0e376743048c56dd1d65e6c37efb3b254bb3f8f6434addf SHA512 4b55a435f17afbb9cf5fb59691a03776784830cfa6f1f7b9c9c64770fa6dbb640bb3256a099328e44a0a6692ed52703b9f253e5229ef71ee300ace34027ba462
+DIST nltk-framenet_v17-20200312.zip 99207152 BLAKE2B b1a32c535d37fddba5c77922f3a6097859d8bf59a4bcf37dca4ff394fcf027fc68d781b3b6c0a8d5cd9a423a944687994ec6a2023dd45808dc697b4c90f28286 SHA512 3696c5bb0fdd59058a8723ba3ca962a0d6422b36d8dcfad0427caa21dae9579b85f2f9af431aa1fc1cb76e823676a996c5a26aca99782017270d03ef3bd77f0d
+DIST nltk-gazetteers-20200312.zip 8265 BLAKE2B aa4f8db99a7ae8c26911d33abb67713bc80ffe3d87b2b21f21360682327b51229c16e7bf2330ae44d998e4d9c9bdc0879000a07197891a70ef9788bf8f7aca80 SHA512 dee3a050b56bec85a97e85d867e404bbb4f5041bf4886f5a00eb3e4149db1ff6f8a5bfa67f0210c48539cbf0140aa2082e2d3e65b4919dc5d7a258dc9616257c
+DIST nltk-genesis-20200312.zip 473239 BLAKE2B 5234c2bcc31ae738bbd6cdfe1ee231a75ceacc74334e3b03fcefb409427406aee1700e0d8ca8fad1444736005b53fdb0fbdb9068085837f4bf7292cf61351b8b SHA512 e74f6a8f304bca292bf8e138a739a97633075cf9374c95f43ec75473cd7bbeb0921fe24f7864659e640e694d227e61debd67bc41150bd898b7241a274e28196f
+DIST nltk-gutenberg-20200312.zip 4251829 BLAKE2B 54448e919010f12faa92c0ffd4adb31a7d1d76c262d5ac99c10129d188830107621f90e4ba611ffc1541c25037fc916b451169a8ae1df276747890fea2b5e06f SHA512 7b4442d18b89f9bb8261f23ad2f4d513b470461255633cbebe2b0c310f003effe819f7a86b39a62a6cde1b3fdbf099dabbdaaed520268c99fd1bb933baa97ead
+DIST nltk-ieer-20200312.zip 166156 BLAKE2B 4c63be6f0f16e59584b7f97c21da0cf60a1efd3678ce662d683d8a6a9d05d333690b26ebb037d3047b26e3b51163ffa666f048ad8a89ee6c851ddd3edd2faf58 SHA512 b55545f0ac03282fb85c4a86bb69fd9988db8cc504114bff7b763007e20f559efa61b8ae5b7955c251c1c2dd9d23246e23515b1cd5d8d28225c20ec236c05cd8
+DIST nltk-inaugural-20200312.zip 329806 BLAKE2B 222e22742d93156b852ddd72ffccdd5be5c907f43973b0ceee8d8d383bc5387575d1f974b29fd6d59499ec86483f56270251e6a5e48d7840c577207849baed59 SHA512 096591769373019883b9a2f85d32bc86021aefb74deb26dc9005233ae7f317d587c247edc53f1a956b8e29911656419d9677c271fb59ded4ca4edf2a80ff86b7
+DIST nltk-indian-20200312.zip 199187 BLAKE2B 9f70f1f28bcb96d828ca2ed96d6f89cd8eca8391b64842c742e940aaf59f31e142a4919ab5a21bce954bbe2ed85add162ac67c9549758f9aa44b3e90d45d44a7 SHA512 08e9342567bcc607e9007aea8415af2cad1f8a289104b47a61bf569f1391629ebbe25f19aacf53e2b14725edfdf18076331f044e1c2e42d07c96d5c875ff8f9e
+DIST nltk-jeita-20200312.zip 16531215 BLAKE2B 7cf98b389d5c4b41d9d613c14c5116b83c520cfe7418ef982a6acc1294d8ecdb8a5d6d396409c9f13ca83df999793619d5eb5483612e87874766afe565c68362 SHA512 491e6dc27d70ebfd139be4dcd179b02c383f38edbe4290aed85638ecd7c987259b025e4f6f3f1d5bfe2b7aa3eff7188109205417d0acb64759988150f2753129
+DIST nltk-kimmo-20200312.zip 186958 BLAKE2B ca288f48c09a33b86449c780e5c8612c3521e22bfe70918629bf3144b75b5566a942c3cc1440a13c2d81e3a69c77928c336850cc9fac4c232deb77293bfbe4e6 SHA512 fe38d5298ffb6dd62223c2002278d9ddf2695f341a29477f44434644bc6766cfe2c73091067f9eb8aea201735eef5012f96ba267f2e6ea3aec41ac206a43cac2
+DIST nltk-knbc-20200312.zip 8760788 BLAKE2B edef7da714f81de69ca7f2d40d3c15dbe9e8ba30a044d72deb96572504ac1f539a3f51e75eceeabb196e3cc3ce4ec2756abd4e22833bc7cf9547b0805ae20784 SHA512 76f2be8b8841fad1a4ffe9b31711614d19090d1571f9d076686f6276908b8b2fa3c7258dc037f1ae7e4eee63aefc041203982bd9d2ea155c944bb713cb1f37e8
+DIST nltk-large_grammars-20200312.zip 283747 BLAKE2B dca821271941708d4ad44a2b0694a9471787ee4fac8ca3f98761404df3ee676f6e9bd529359a42292d00584ddd63658cd5ea4b455205c73f119cc4b15829c716 SHA512 86ac15613a918a924b6ffcfa4eca1491f76b4d3f92d9c3fed25257dbb0ebcf20fc69800f4bac07acb2915e3d75d41d309afb61d5354d71eef396ec3866e57f8a
+DIST nltk-lin_thesaurus-20200312.zip 89154019 BLAKE2B 7c039636911fbd96caaae6453f009fdb721efa7ae7b697c61279130f17cae5b6d96a8c8c12506212335925bd1ec7e58e609b72236cc0585bd26a8d144e4382ab SHA512 7972391164c7fcc90be80f4b673551cfbbaaf78653331023119a4296d6434c5a123dbdd4e631efc732531d1a5d0eb7ed1c9b46a2680d6493f3cb2bcf5c6f6d45
+DIST nltk-mac_morpho-20200312.zip 3013904 BLAKE2B a4a7e59cadf913179dd62a31c7e666e9dc134812405c3cdd78de2e772e457ff31d42b98e4f224756d921c584fd13c3708482007d575386bd21201c38492ab4a5 SHA512 cb892e0f414d12848d62012409cc1f19ce4c42490dd9ebb233110f75029670b1d413d82602a3b6cfbcc17fce448d7d5c2d1a811059280e154406eaa0e1b5c4a5
+DIST nltk-machado-20200312.zip 6151774 BLAKE2B 14642a214ec7e10b6451d1077e814dcfb2a5eb5f728ab368e5777d047c445b8d3e9f8808bfe9230107497a42fb87ad5c30295893352688923ee7efd456539fb3 SHA512 8ef0b5e768edd4a8b6c6831a6a4a1913ce82a57fc63f08cc29ad8442c49bd4eee27930c674f2ac141fb1f58eebde5486d12e6aa41681cb601ae9ef9562843a69
+DIST nltk-masc_tagged-20200312.zip 1602143 BLAKE2B 502d648d3af1689d81fa9e8f05f6c0dac51781f378e7cca0199a203eb98fbd61b038b075114e69f612b699c18a8353c5ab8314816422d402f84f9fd700004763 SHA512 558ee5fc652ca736f0f0238854e75d5d6e7f85e03bd19274b09bca616aaa8edd513db972f89c13232e616945cc3463fb4bfda96c2b3d897448d3b5d9ce370f20
+DIST nltk-maxent_ne_chunker-20200312.zip 13404747 BLAKE2B ef15eac5306f04adf23cce1810e80a5566b07b96616a27faf30ef6cde055eedecd82cb2b7d3f0ab8227e1e0d31fe998f0f4a93ed3ea80ba7d7f9d802fbee130c SHA512 2ac628c6a1d68a262031cb9dd0b3544da182cfe239a21996c7b7ccb540ce92b9fecb7a7f038ffdaa5c5e3a4d8b7194c7d699fb268004ab3bdb6cd671409c15d7
+DIST nltk-maxent_treebank_pos_tagger-20200312.zip 10156853 BLAKE2B ca58a0ca3e280d4acf77fc15102ab52d5fbbf2d83a6dfb91228eb7ca3585f8b1e567847c037221b3cf1650d086c1de2fcef3098a3859603f58a2cf24a6b77719 SHA512 ebcda9d7f843e47fb560530b9b75b44ae97e92102d341453202bdc42aa2e996c76833285134ca5c213d7af5b420065b28170cade92a67ba9a667b1b987b8331d
+DIST nltk-moses_sample-20200312.zip 10961490 BLAKE2B 29d4a055788f7fd87b2969a1396eb655dec29ff206e705c31bbfbecd88ab18e4dbfd7409f5c21cd5a53e3461c183628fe689634ccdbb8f24cdeef56bba230730 SHA512 54abd8add66ab286f3e016aa3fb7e74be0d4062117f382b13dc34b0091e823144f203e6c9b2c8ccaed6ffdd23e9667fee4aef57c7c95d7a3f4703eff7ee082a6
+DIST nltk-movie_reviews-20200312.zip 4004848 BLAKE2B fcdc45d77db5c89ec96ee327487e3ee17b89110c8b83f188bac868f476425e732a734e667ae3fab84cd0a2e7fdd49baf33304ffb443d17a3ebf3505a54c8e9d4 SHA512 d1271c8bc2585220418eae6f5b12137bee874ad6097875615bc254fad6a4f33e3a11878c41666fef4d16254d8489784fd3e296378ea1967acf73ba494140a1ab
+DIST nltk-mte_teip5-20200312.zip 14800561 BLAKE2B 32de06f0cc6111a4ccc1cbc99070e17018eafa45ebde6f99f8027beaacffbde1db7db9d6d2ad1044f431c52297d1809ed76a3b78543506c620bbddce88c8ea41 SHA512 10ffd9801bc665c4a3af47c0591a04144ead556cbc80604218d97f334139513c7a23ce90c76b2bfe873b16f94bdee5cae00bd99ae9da8c7d0e1e21aabfe99d74
+DIST nltk-mwa_ppdb-20200312.zip 1594711 BLAKE2B 2d9c4820da713072269a096c6c3414195888cfc4341a2d7a30e6be89efe0d9793e1cf830bf6fb396aac3da9236667c1246fad7c6d474810291e5addd250cbcd8 SHA512 4fbc2ea7c211298847c5fdfb187ea3d08fe385611fbc51d1a21efbf77c45ca886e4765007204ed03b03ee6431804d8cb4239416e95b7eea20f2f612549a60727
+DIST nltk-names-20200312.zip 21326 BLAKE2B 37389f44337724fac26eaad5a4c310ea244b471877dabcacbb05e927e486f28b696bb7898809c8f4e739566ad09bc2f5d5103e0f5c77fcf280127328e2709fa2 SHA512 72513955d69d57787ff6d74b563c53bef7e350acd7dbe1eeef0a00253a9862663d59588551d5deeb02b88fb691d72b630a65315f513b11dd6d683af5f2e4a6ea
+DIST nltk-nombank.1.0-20200312.zip 6728397 BLAKE2B 4dfb61fb651817086ad20957d02a39c99dff83ec45dc1a01e1dac055e0d7d9ef8108ef8df5218ce57b2b0866b748f5f76663a58e08f5e1731272173888e714e9 SHA512 81bcf29898ba1205ccbc8486d14518e95b0189b97b172a54580d4cb4584f4ef32a5284fe0abed0b954e67a318de8f10de5df1a20e6c08976df09cd2c9c66cb32
+DIST nltk-nonbreaking_prefixes-20200312.zip 25437 BLAKE2B 9f26efbcc429f2d36a5584014604d2ca9d4de3864aa6192a114e9e2cd55ff1df50d4321a8c40dbe96225fad6be51f7be66f8a4d58146e370a1d33a9305f7f95f SHA512 4cddfdacf1c7fe4e93a31333413a83ebea2ea7907f4e6fe99e1c6ff4aed8efc951f4399478cdb88e5d5dc01cfb734b6672f4007b2a17c4784896fecaf4a93cdf
+DIST nltk-nps_chat-20200312.zip 301366 BLAKE2B eadf0d0ee54c982765a124321fe6de161bfbfbfb0385d5d7e171666945ef313b960ba7b820e4fed02885ffc213173e2a7c97822513ae6e81e68858af21d9f424 SHA512 a716f3aaba3aedbbdaeaff1c6f1ef607476aa989eaef1c2971b827215f3167952477b203f1543d0fd27e5594c3efa01349c6a0efc1ac423048808f18eae94643
+DIST nltk-omw-20200312.zip 12110409 BLAKE2B 5ae634fdd89fcd2be2dea1511b2c434755e16b92804ea6bb5e2139113c644885d142e6290ab1d00c418f9df5241ebf6b7e227da120dc95d1e907d8f62a3d0cb8 SHA512 05b1b014e55738c49b34ea0371b95bd1d1aadfb0cc57325ec110ffbbb1250690e3a4aab45e877eb5433de08c7afe3ffa161e29976a0e28db9803fb637b24c843
+DIST nltk-opinion_lexicon-20200312.zip 24947 BLAKE2B 47f5a381647075f4bea0230939098a6e83b47860165595e72f076a5436e2aad78148606d86ec331b1cc323963cb2a7500669c6fc2aff2e889bbaec8fd8571722 SHA512 75b98afa24c4e0284b2858770d7655c5c399066f56c1b9aa03dd51d80f211a694856695f8192570f67f02f0827ab02c3f3b65feb5a56c04bebfae3b58ca56ba1
+DIST nltk-panlex_swadesh-20200312.zip 2861668 BLAKE2B e9ee3b2296c8738319a86dfb39d34fc5bcd346e725e0ad0df74571106ef05b71126aff70914e914cff436674c0d70c71ca36121331ab0327932505a2bdb26be1 SHA512 d096be1c6ea54814e448f61048f05df9107c0ca5f75eb3051a009efd4643c94188dde03505b0a4ae29d301268ffc5153eeee7012aae18310f9288fd4ccdb2287
+DIST nltk-paradigms-20200312.zip 24902 BLAKE2B 73b0e811f80b0dc64ae62f8a0ccb3906b723badd9b1ffd358194f7ea54373ecff4b75dcd4f06e3382793ea7bd31af98c6d31ebac5c793dd1446153e6d4371b13 SHA512 56ee67f12d2e794e3498183ee8cd90e7d691ab06f576ec413580fe1fa7ff49cbec01fc44c5b323fcd764600946a1866338c0babd6eb4cd658e006e4420205fba
+DIST nltk-pe08-20200312.zip 80735 BLAKE2B eff34d07cd6717e1e661ffb8b187f96f84b88fc53fec2d3c15c1591d3117c461a996cb1b9c4890c41f2d82e1a8253c3583149ef34c8c42858855ebac5659923a SHA512 c0d5fd7df03b0d3e69a45e4f3463cd3a9de76a4d66c39c0b0641ede9ddab34bae1df14d85c85c78b4d375b478b4a4cbefdbc5100aff6c4e59e550903537d3416
+DIST nltk-perluniprops-20200312.zip 100266 BLAKE2B 36459b87d3cffed0e1331c322b0eece674d2cf7a9ec9a8bf5ebebd6bb11dd417fdb0ce471bd7078a24e02b5477d162885a327fcb0b89ec5e845f81c58e74544a SHA512 3e18cb45063d3e2ea0dd5160d85efd672a1a4f94436f732470ef63c8dcb2b3faf92db38d2d4e9363759140a9380d212809cbcca0f4459fc35bbe9a77fa107b42
+DIST nltk-pil-20200312.zip 1510205 BLAKE2B 389a0df127ac521de7de262d7400252a7353b8842e5b27f948cfdddb7bf565702e08ad9341f0be6e0c7725882cda5a3db960c258d0ee42eb8023729291bf0640 SHA512 1964ed66f604e6c39c9cdc8a5a4d9603f00893d02129349d84cf4cce68034c04889e9346df7719d6f468c03ddbec4a892035f3330cb408a7f6aab0bfcb3b865d
+DIST nltk-pl196x-20200312.zip 7051453 BLAKE2B 2f5cb39fcea51d27992f5dd410dee8706c0c8146d216e4353f4e235a2df1833915ddc87cd20d78b9d7a503c10a6444842bf7d64fa6dbbb5bb1f28077b524f345 SHA512 2fbb2eec61562422ddbb7fad804c450a9b513dc10597c3341b811a3aa64a28ccaa836d736ac79319edba73c8c8eb85d4103573cfae51d649cfcbdd3a758e018d
+DIST nltk-porter_test-20200312.zip 200510 BLAKE2B dee1fda2cc678a4cdefbf2cf3084b1c8df08bdb195d2aa42fd3314870d5dd96f8d8219987b3285f9b040f3e493a97f0f306b0ee8309fb73a44ee25aa24fa97b2 SHA512 8be4d1faad2b64bb3b9bfb2831c5b870d4213929fd087a9c62e217b85df9f1c1946c9cbdefcc51c110b4980f44bccc45a61c683cfc951b0913270fc11517bee6
+DIST nltk-ppattach-20200312.zip 781714 BLAKE2B 26613312b15d391883679d26e528949cd8b50ed88b45776f87c40180f12a4a79dcd721a7d9849b97d6b86d5ddff871d29db674ae750b0dec94f042a78ab67458 SHA512 9afa462379522276c8cbb8e8dc14a57cd221272da40056a1ea82daaee44a8464283927a9fcd94b426257bc3a04374db058cfc4fa31435cf8993942d05444befd
+DIST nltk-problem_reports-20200312.zip 1032942 BLAKE2B 45b4536712c6eaa0c4b385792a22608737b3790c0e3b644695ac18b3327c216407d56c4b3a89c0d9123eaad4957104421b0c218822ab9f96000e9c0f422f332a SHA512 766c45a016171c7440ed2604db11f1ab1cf39d287bf9d10c87bc29fa76b0661a048fcc484328427c39d59556efec928a5fc687ba2d36e555c75826a28df2372a
+DIST nltk-product_reviews_1-20200312.zip 141287 BLAKE2B 18e83e267d3edadd7eb99dd87f9094b0fb426006e21497d373c190d17859dc50adc6dcac7101e2e0621df06c812287a7f969ea9c4149521b649ee7683c8c92b2 SHA512 511a719b2694bee35a39057a770005f0234beebc289e01aceddd432adabc5e580e99cc8a06c960855764709432b24b62508daa5414cd4b1e491379c3f07649de
+DIST nltk-product_reviews_2-20200312.zip 170698 BLAKE2B 395d5ac71b759a546a3e25fb6ef584eef5f6076fde9e3952f12a3b094360e493cf306fd07a3d266f46ceb37d94f000faf387b99ed7b4b9dec5cfa661b74d8318 SHA512 24ddd6e62a62ef81ec88fa5991507f3e7e479caf778d3738bb30ddc74898e03a8629a532147a0e0b899b3b2e0933fbefdd0ec0b0e774fc726c4145d66bf5cd29
+DIST nltk-propbank-20200312.zip 5323498 BLAKE2B 7abde7caca8fa267bdcfdacc507ecef133d80989909563380f9ac296213ddd94680c25b0e9eb339d2ed7f851eda185966f6be63ed86fae5d9d397d8758ec75bf SHA512 6bb6f88648c1f6044d30dbc56ce1924240ddc1420c7abe8ebecf3684d69bc2dc20f1c575a6ae368a39fff40c5ef09f80d94bf317a37301639321085f102b54bf
+DIST nltk-pros_cons-20200312.zip 746276 BLAKE2B 7a1aec593396f25c75949bde9627b0568e4d406fb2d42ccb6e7be6ff7d3b118bdfbf34b574e40cdb27d8e65e5a370f820c839bfbc0140b2b177c9dbdd89e3516 SHA512 9eea1070b67e648d5e6e84d5a336d95c9eb3192cd0fa4a66f298076ac5907783ecda5cfe1b6e66209a99ec4e326666b14e1cc89ffb4cc8b2b91fed7db77db75f
+DIST nltk-ptb-20200312.zip 6289 BLAKE2B cc4c91569a5a9bad0f3cfce8e0d98179a53b3caddfb66ece085b293b882e46328a108c3be86261dfd869ea2fcc33d3687bc41adac3ada88ea748cd3480c88836 SHA512 dd59d84f20ec52c38063f0961954c063f614b94696307a63fae21839c51c81435d0205fa276f6811d9da7fd083e93f51f0603b5c35dcddc539a9bf59b9dffb6e
+DIST nltk-punkt-20200312.zip 13707633 BLAKE2B 22a0b3aec8a72735fb39aeeb596601452a70cd69e91206e94c5204153da8dedea3b7ec00c460fe19962e223adfd9cffc05a00daa4df48e3f2da500da51e99d2f SHA512 d477823c0cab57a374822cf58dd8bad35b86a504a51dda10de33b27cfd2ca4c89bba946c09f1ed3b28ab4d1cc1e5f3d3781c5f78ebe87458e1df9d48c8841782
+DIST nltk-qc-20200312.zip 125456 BLAKE2B ae02a2cd727e1832ede5fa5b3ef561059c9c714ced80bca311a360dacec6e176c034d802838335437cb8fa25ea877c3774fbac6790cdf6b9b75a3908e76e2dcd SHA512 8cca288f51164f7a18e1c99f2b6a35a3d890bcc5471b5be26883dd87d9b5e06214d15cc1133598fee330f0daf74ceaaa862410f6d95dfd56842a2cb84dd67390
+DIST nltk-reuters-20200312.zip 6378691 BLAKE2B adcb783246f4fd6e73b32fa9419570d9c7e9ecff1c9e2ae39a587f9036ab5eaf9a6ed87b7794dfe2f0ba965cffca68e51c243e3396c179a0282555601caee55f SHA512 21b5a4e114137cd8a52b6e3567c2d72dbe7eac8a3234952896126d9d598fbfa72a7c1e6b7c424b8b016ed345d7350cc562c6d039e2fd6ec0d73c68e13e58605e
+DIST nltk-rslp-20200312.zip 3805 BLAKE2B bf4500dd0e28cb4f018b8d13c5b920294163e7a9735dda12935f6b25ce5c71f2181e3bcb33444d490ba7bb1a029ef50dcdb2df4bddb09bfaf9d91b91531b8461 SHA512 83994b9cf2c925e559c68a03abb1499e3cd308fdb5ac6e698c3b5e2458393d9eae2420b02b4c46fd04527913133a1f19b3a2e5312f07e222e8cf072886f94973
+DIST nltk-rte-20200312.zip 386303 BLAKE2B 384e22acb3ab9e37aeed26e85253a19bf14a4c0fff1030db5c530faa12380d7c8334278514e2e3a25ed6169f7310f960a116b10e1d021f5521f1271f2c039a05 SHA512 a091b04f9c63e4ee193f40cff7a60905a9f5bfbc687a27c7d66cc6f6896e5abd489dbb820f98baaa2a4d855e09b92203c3756d6eec31ed4f4c9dab9e689438d2
+DIST nltk-sample_grammars-20200312.zip 20293 BLAKE2B e59884ec6ed4e7699a7d066f332d09ca26e3abf53a801b37e8592c79f2a812a0fdb17d4f730ef5d820def8b78578c4ef8e60519e74a352f731884f91eccae75a SHA512 c4a99d38f6a85c6639d50b6fe9d0ce14bc87eb9dcb4a1a2dbc6756d756f903d6541d352401a42b20eb4a0df0ad1f41d75d33915b080345a9f88f4ad15f146b34
+DIST nltk-semcor-20200312.zip 4397021 BLAKE2B ef1a1915309bc4b829a6eae9cea0ce4a0a9e5caef731dfc107e634a6e6c2fc68173715d66b10d71931e4e46ef9eb09c13baf91fb909868a5d920d5a371e37a0f SHA512 a6d45b8c1043d94493fea3abebfb6fd1d690192243739d98f0df8a3e89de3a07da17078bce9797373c2312a249cdfbee41c952f1acce4e17c5cfd12e02d7267f
+DIST nltk-senseval-20200312.zip 2151350 BLAKE2B ec21463f7ffe27e3dda47e38e356346491ce64412f6d4f537fb73e72a0d6dad2667d75895add461dfebc729d44b091365020f7e9c93c7e7bcddb00c74f90ee30 SHA512 4c69145668cbc9d25d4d6893caea417258fc34c23527538f5799d96da19f45aeb271d08f48ea434ee21b7cb499579631556aa517c294743bc64e0f840854ba9b
+DIST nltk-sentence_polarity-20200312.zip 490256 BLAKE2B 56a1b4668875d5c73b220710e8af84410a273f394d68d7e9decb47c5370f0f7c1982d37710ab903d78dac0108ec2e1a26e8f1cdd3fee172487397c8b2ae01ee7 SHA512 d3772edc13d46fb835f40a9c6714b95885d3ba2eded37a24c8aad6d3e63ca49a9c79387218e2f67bda2e52d52449daf1dd1e603a4819ae46da78f4e0a38ba599
+DIST nltk-sentiwordnet-20200312.zip 4686546 BLAKE2B 98a45d9a5be971bd9e76ad5c154359c1800aa96ab1e8e301215e435cae6d659cc0fb38604435da4e9a391741a72ef7fcb842a1cf119348a7039f717843035f12 SHA512 cd0ff31abec3b85e3114546918c9152360e3da45fbbaf7b4c76a13a6d288f1d9ea7a4633638080cd0205cc293b4123445106e9b5daddb2ae379d6a08d1c992a5
+DIST nltk-shakespeare-20200312.zip 475458 BLAKE2B c60acbf2d90f5fffdae43bdf85191dbd694ec5ae19e37b145ad04af6d8481988f0c2ce644baff2e31707e86d95a9b3b03c1da4faaa46be92da2291cce565955e SHA512 72cbbdb237b78df1f6bf27b5504606f805433746b9862dd1ce544ab712783a41fb165f3a4f282a7f1bd58c7dfe6036c14ecb07a82c109f6d326eb95e470002fb
+DIST nltk-sinica_treebank-20200312.zip 899237 BLAKE2B e074dbdc7f1fe5958109f8ea019bead946bd2e3cd1c0849cf3ad49eaa34a1989baee7935a70e9103c121e5d375954d1dc53e4a1599e394ce6c2fda4e98769645 SHA512 724fb0905cbd0f903c3a75d70aee8a3e6f29e62971c1501efd8bfb3f4c6619e5fd8a285648e7792ea8e86bbda0a2afc6eecae45ad9f361983bf83b756ab52960
+DIST nltk-smultron-20200312.zip 166207 BLAKE2B d0c3e75dd108965e260d913e0c02137da8707a751a4f53a1f59ff6a26af9997e2f0193980b5087277b87b74d9aab3e3c8d066d7264311cec2a94f0f9d21caf44 SHA512 8872717994269caff4ccf3f52cef273b2ef007efeb8ab89392cb4db3b8e3ba4b8859ab2fd70762b8941bee4558aebba3021869772be17a64e1056968aa605e19
+DIST nltk-snowball_data-20200312.zip 6785405 BLAKE2B 44c10439b142540ac7eece967efa1431fd8f45342f0a90875dacf29ad374fe4c7d30af11d42ba45e0f1ec1836d56b2ff684ee352c5e8536cfb5db5eb7632285b SHA512 6c8a9259d88f6f7f499867d83b731de99d7fa4e8827ecedf836f653fc1a810efa9f6c5c6e2720a9e6610bc00978956b6a119bd08b70e3e241c4e9faccddd81d8
+DIST nltk-spanish_grammars-20200312.zip 4047 BLAKE2B d8a8dbb558850a6a60f1fe5ab0f617f3a0f3c64bc7d49980cf793d374c6679d1bd42afb7e61776737b5eec162f2520abf2ee3acc92ea9ee0f397c3089b3b5b28 SHA512 4513347156e9351c259c0e2448198d68354bbc95e0a54561c31a88f13f333ebcba3e294c820cb62036665f2904eb6a7137546cd580e361c0423c30a8aed950b2
+DIST nltk-state_union-20200312.zip 808757 BLAKE2B 2d352af0ced736d3c11a821eaba0b035b3b5b6b0f20db3bd5d4ac2451f99daf68bffd3ad984bd404ecc4a1e67ca9281c529af2cc9e295a3a7330f36ee9640bc8 SHA512 a6fc83e6230e57ba66a7af62f0d2a5d44a14530ed1e0e914590b3f8b5bf939967c126a5e6f6899ba9134843893f65212e836d311109393c1200191a5c3163485
+DIST nltk-stopwords-20200312.zip 23047 BLAKE2B a0677cc0d4a3d54ee6e27eab8fb7635d6cd29265204896870e57457a54459f1d6cabc0c4e76e749397f5eee299eed0b524fcbc2033ea17d81cdb6cd98c5ed968 SHA512 31774fd3db2e0fba0209db71c08f9b2d971311ea4a59739cfdc0a9ae34f6c6c593f2a991a14ee20b0de8b380215e609f8398ed50c546775322ab8c4c3f8d06dd
+DIST nltk-subjectivity-20200312.zip 521628 BLAKE2B 0a8777a5b91b1b825fbde41cc927d496480129f0a810349bddde2036ed16f37611f2bc3b007e74fe36523612529a92433d32d094be72d247f5faef8220c3c491 SHA512 a3cc4d2d20f26c5eabd86fbced2c013e69d46e607013278eb35831a62e57523a17aec1b580ba62c7a867e61a561e1b222d8430f0c1e2d429a9479e12b008c5b7
+DIST nltk-swadesh-20200312.zip 22828 BLAKE2B 1cd9d5355c6b53694ec545ca001b0807b4912a7878ba075b0f81ce8b9b22a5c7a18cf52cf2449483a1c89cd20d8d86986dc3d827fa93a7ef5824fddbc0922025 SHA512 90cb32532a5378d05ce34b84b5f8363dbb32f24afac58b0dcc5cdccba98fd7d37def7f4fbe76b11c8c64059bd19df745562bfbf5f4c721d65ce9f4be1348ab76
+DIST nltk-switchboard-20200312.zip 791161 BLAKE2B 211116a751ae246fa31b6aca96b396d3642d89ad112588a09f8d91a5b76dc41c7fb4d36c16c6358cd8e0da8056bc83598ed0dc635cf7b1fd8469a0e80b5f1761 SHA512 690e5392dc082c4ac550bde2848aa65117e7a25cbc4bce0887581c531d03be64e21f044ac0a3286648255f0edd7766b1161f5575ad5fd680c7303b34c3226b8f
+DIST nltk-tagsets-20200312.zip 34531 BLAKE2B 7966055f926f33cba5929f16623afdb232c6168049e07728fc6629641eb5e8bbeb47fa4c2e3ebb89b7f5f49855565ebdef17c9a9ac7e7cad58fd42ab53d5c333 SHA512 03a5a9643c3f242ae699f4d2f5269aacd167e1ddfdb826759dc5a51271e37efb89ac7d3d316e7573b00bb92ed2138c8c72cb0b67b7f28ca8ac93ed0a6d48c798
+DIST nltk-timit-20200312.zip 22251869 BLAKE2B a4ab4f5c1973bd748893f15ac47eedf7e1bc6de1ea40caa9e5687535a5e2e4f159ad98aed5b2ca78133146b76fc593e279502d3cd238fa3a96c5825d8b31d773 SHA512 0f7f853dec95431f2c3f404ec275a06ae43c3d4a506e787f94689521b381bc9cb4b76fc93b9829bcdc3265f8465e11e0ddc2e0d8a36c88b544dec21d679b9b7f
+DIST nltk-toolbox-20200312.zip 250616 BLAKE2B 34b0bda4cd229ab8beada332e43a1040a4cde652129b70b06e8409bf467423af6f15c8001ee9b162cd2714c9ac556e6d0c484572b36c85126851d1299e676e66 SHA512 33f50ff645bc359894de098000e117d7a077e9422a3318524b06bb895c10c33a583faa134d79f620ec95ef5b3ba9d502d15e68b531e6429845181da9b163307e
+DIST nltk-treebank-20200312.zip 1740034 BLAKE2B cd40ad3df05dae4d5e4f7a3a4154eb4ac239b0f41b1798da72e023c7878afb549d29ddba73111dba1e2f34554538f1f949d2622363deb1db9a8f5ac1b4dd455a SHA512 f2b3793329259c16b2544cfb544270e9c4b4d25eccbd997c086d3d5915981ceb038f719c097d4c71429aa9e7c4f0e19918db06b249166a2f6153d6b618c96a77
+DIST nltk-twitter_samples-20200312.zip 16007673 BLAKE2B dc100181ff129c2e9774c643cd0841220496443902ba57818c327929782b4d6e0434ac083c47cd362401d4aa804635fd2e008342755f1cd7962f23e0bb57e3f6 SHA512 1c950e0135e8da58540ca618a1d52f0359afde44b429bede4005182924736d3edbc0fe52638a2101664490fe7e2eb25d7ab704909b241b9349b8c5aa7a185892
+DIST nltk-udhr-20200312.zip 1170177 BLAKE2B cab5f8b4ffa9c0c303620a499f61bbe654545a5689aaa710df743054836a7843e826d97ee91f7b84ba2dcd32ffce432f1932c53797e791de2776eaf183870988 SHA512 730f83cb663974edc099427fad0a3c1cd16769f5ef82446a78aa0e12f01b68a4c52037bdb5e3717dd6b08c42655b03d9d0f2f5869c01b117ae2ab8a428df0dcc
+DIST nltk-udhr2-20200312.zip 1653975 BLAKE2B 43ee8c9c490a6a8ee5f0d9273bab83f77ba3d977da7eaeded3239b8c16f3432ef36478d0d290b1df9fe9af4e0c5de1902b8fe699788b7681003e5c8c426574e3 SHA512 eda644aedba1d0b9d2605dc6b1c8c77943692b4404c4a9c904c82956432c9ba7e27bb0f496e63d3eef7f29b4b34eb929f8d9228b4fd56613d5d4f743fa1d87e0
+DIST nltk-unicode_samples-20200312.zip 1212 BLAKE2B 715c6030baf56d766dce7b920500c29dc98ce4edc9b41fb4c858960a4630469fe5a269be6d0b2d33b54ee0208014b1e43f5d9ec49b878442963ea21e4e3acda6 SHA512 88cac1ef0f43fc491285c0638807f88f52d1baba8964cb2cf37c4ee3e8cd8c080071f108fb553fdc14ed17cb0e80343097c08ac7c1613b0abbd84f559e38fba8
+DIST nltk-universal_tagset-20200312.zip 19095 BLAKE2B 179297dac3a6a30a4a55e8affb480cc95a7a1dd1ee0ac6a0d4381c655c5dbf065e16f750ebd28846299f114d3355a9edf6cbed72242ebf5fd257f339fcd8f40d SHA512 ca8915e8c3bcb8515ffe0da21fb7066d213683ff702bf247d9fa9f7c966c66d06facafef1d600b7223d7aa1e4dfba1a2a75e1bb05b2c5d78beb312b8551d3fed
+DIST nltk-universal_treebanks_v20-20200312.zip 25908853 BLAKE2B 20f8a6c0e759ab15560e71f48527e26a0dac554b8996dcebcacd2c5d5e3cea8947481734a03680843176025fde6d5e484e9414dd239043dca380c79053d71d8f SHA512 3dcabe02ff8bb1044f7d9e1c5af294af05291ff33a6c8dc16d3a3c855ccbf3a8261d92ea926e15e0d7f9228ae81e7863a283b156812ad30575c9ea947ecbcc55
+DIST nltk-vader_lexicon-20200312.zip 90486 BLAKE2B aaa4a07a600d6ef53373250afd62a1a5e76cbeb28b56c06879d07a6cea593d7041300e6eaf2ce39b87dc0c04a536fae111df80604e7f517878e680fb9db16019 SHA512 55d41757736806a2a0dbcaedef28000b4461eb44dce308c9e06315feb55f843914f743c37b6cf220fc9e61c04f8f7f41dd338438059e2e803fc2b4e91f739923
+DIST nltk-verbnet-20200312.zip 323661 BLAKE2B af5b26a63853d1bc3fc99c9cba3c50a82bc148cb6d95a26045c9aa29ba83408db76b098354e99f07696d5e2373ed9d8a35ed8504ac9b20e1b4d16252a238bc85 SHA512 00368fb2d5cf1bf818f1d72762d2c40a98067937d71a65ece2ca0f4bfab6f8957b32ba66741901af34258cf3309d00dae433b1092d0b7adb145a791dc3bf859b
+DIST nltk-verbnet3-20200312.zip 482025 BLAKE2B 256b922160c6c8288d1a80d2eaab686988e1bdac2c7cd04b99f1662815e77881d97d4655a22480f3c3ba5fa5117d1098f416e5c460ca725e299e94a5e9bbd12b SHA512 aa8f0eab8d132d30039ae93d077970183ea87e26b3918b9760cd9c86cfe2a8b09e49569f8c4e12af9c3b964d9d341dfe10772377843ae9b6d888a439acf63268
+DIST nltk-webtext-20200312.zip 646297 BLAKE2B ca072fc38c144b659c76c36c9161641c91f814fd84e5cb8bebe5dfe9382e5813bfaafafba2cb8b2e23d014f071c5b5a9589b78a2028418ad61147b0de35b95bc SHA512 7c88ff4920d2b61bf7e7cf8265100d8781c061c0742852a8b49142eb9a4d48e34ecc0c6257cbb7d7c2e7ea7ee87e942305b03451522776fe09a91ca86c2fbe4b
+DIST nltk-wmt15_eval-20200312.zip 383096 BLAKE2B 119943db4240171077569b3302c678644c2c9547ba67bfd055751059e0a3ad3ab6a19e4eedf9108d313d46dcd36cf19e11d973981da8c70a01c4cb790a7bd739 SHA512 362395d4c77ebe92f4c19fa8c2000082dc7a2343acc19cccb596ca00db6c40c231b904d807f46e2691cc4c4a0c79d14873b2a1983a494f2ca1485d540d787ceb
+DIST nltk-word2vec_sample-20200312.zip 49396025 BLAKE2B 0512b9bb7121a528190079f578e82b4e8f8021bfc0062cfa5613d260f3eee17460aecfeaacb65d950e79d27a653c78633a88c3638ec16377e2dbc3006387ebaa SHA512 09c30a4ab8f9fb6a5b36974b5953260d01cb4f285827fb90a374d054ad775ac978602ab56c452f46f4f8601312e232fd739d2f54dafa44ab8b7b01831cf0d9d1
+DIST nltk-wordnet-20200312.zip 10775600 BLAKE2B dac56a8fb1fa6882b1871c394ad2acb2d3be739c424570e27c89fb6983df5f896a8f359092ba82752ddfc0531d83563a219e85f80124202f29bda93181efe4dc SHA512 1923a8bcd56fa0b9a9de91f53070dce28c3a7efbab11d2ef55c87134b1bf30de0f40abab59c39eb15dce54aec9491d8a5a259de212ff4cb25cde0ad09317009a
+DIST nltk-wordnet_ic-20200312.zip 12056682 BLAKE2B c2dc2a646015b23699a72f636b588ec5718c70e6941d9d56863257e1e0396c8cf59ac1dc6ed74e5d7f0c2ee9129d63221a03967bf66a3d335e99160f295ed44d SHA512 1c94451a13af6c76bff60a0cab2e70402a3d9abd2e8fc62a5473f24ab4229feb0afe4faa8d389734697a6cf86d2c8b1dc700bb3afa3cbc279b75d7e0ec19fc6d
+DIST nltk-words-20200312.zip 757777 BLAKE2B eaaaaab6c26e206e9b6ce45daf779e3cc6706a06132afeabf013026d0009caee2d678f3c4ea9125b9654f7143bef29ec7a5706b79e5650ea556c6821b7754e6f SHA512 2810f05d3fc7ee6b6f8636fa1ff7b4e8c8cdac12b415cc54d15c69102290122ea138ec4fa36cb483f790c1ac10b0f83ae4c2c3e0e8df7e67e90e962ee5dbb0be
+DIST nltk-ycoe-20200312.zip 477 BLAKE2B 574835aa011a06a06363e26facd6a6f583a1dc1cac2de39adff59d8ab48eefac030b43d935a2f79af855259f2a9a571193dae2811589483af97406ff05c76c9e SHA512 e39ce165074d10ff63cb84ea52905d7ecb937797c8123ed113c5609afe1f63ac44d04d48a681002c4eac21dc9076ac74164b886c6f9ce42f3a102c38d1e8e756
diff --git a/dev-python/nltk-data/metadata.xml b/dev-python/nltk-data/metadata.xml
new file mode 100644
index 00000000000..6fae393f618
--- /dev/null
+++ b/dev-python/nltk-data/metadata.xml
@@ -0,0 +1,14 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!DOCTYPE pkgmetadata SYSTEM "http://www.gentoo.org/dtd/metadata.dtd">
+<pkgmetadata>
+ <maintainer type="project">
+ <email>python@gentoo.org</email>
+ <name>Python</name>
+ </maintainer>
+ <use>
+ <flag name="extra">
+ Install the complete set of data files. Otherwise, only
+ files required by tests are installed.
+ </flag>
+ </use>
+</pkgmetadata>
diff --git a/dev-python/nltk-data/nltk-data-20200312.ebuild b/dev-python/nltk-data/nltk-data-20200312.ebuild
new file mode 100644
index 00000000000..1d02afe6be7
--- /dev/null
+++ b/dev-python/nltk-data/nltk-data-20200312.ebuild
@@ -0,0 +1,184 @@
+# Copyright 2020 Gentoo Authors
+# Distributed under the terms of the GNU General Public License v2
+
+EAPI=7
+
+inherit check-reqs
+
+DESCRIPTION="Data files for NLTK"
+HOMEPAGE="https://www.nltk.org/nltk_data/"
+
+# at least some of the files have poorly documented licenses
+# TODO: create a USE flag for free-ish subset
+LICENSE="all-rights-reserved"
+SLOT="0"
+KEYWORDS="~amd64 ~x86"
+IUSE="extra"
+RESTRICT="bindist mirror"
+
+BDEPEND="app-arch/unzip"
+
+PACKAGES_ZIP=(
+ # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=0]' -v @subdir -o "/" -v @id -n - | sort
+ corpora/comtrans
+ corpora/conll2007
+ corpora/jeita
+ corpora/knbc
+ corpora/machado
+ corpora/masc_tagged
+ corpora/nombank.1.0
+ corpora/panlex_swadesh
+ corpora/propbank
+ corpora/reuters
+ corpora/semcor
+ corpora/universal_treebanks_v20
+ sentiment/vader_lexicon
+ stemmers/snowball_data
+)
+
+PACKAGES_UNPACK=(
+ # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=1]' -v @subdir -o "/" -v @id -n - | sort
+ corpora/abc
+ corpora/alpino
+ corpora/brown
+ corpora/cess_cat
+ corpora/cess_esp
+ corpora/chat80
+ corpora/city_database
+ corpora/cmudict
+ corpora/comparative_sentences
+ corpora/conll2000
+ corpora/conll2002
+ corpora/crubadan
+ corpora/dependency_treebank
+ corpora/dolch
+ corpora/europarl_raw
+ corpora/floresta
+ corpora/framenet_v15
+ corpora/framenet_v17
+ corpora/gazetteers
+ corpora/genesis
+ corpora/gutenberg
+ corpora/ieer
+ corpora/inaugural
+ corpora/indian
+ corpora/lin_thesaurus
+ corpora/mac_morpho
+ corpora/movie_reviews
+ corpora/mte_teip5
+ corpora/names
+ corpora/nonbreaking_prefixes
+ corpora/nps_chat
+ corpora/omw
+ corpora/opinion_lexicon
+ corpora/ppattach
+ corpora/product_reviews_1
+ corpora/product_reviews_2
+ corpora/pros_cons
+ corpora/ptb
+ corpora/qc
+ corpora/rte
+ corpora/senseval
+ corpora/sentence_polarity
+ corpora/sentiwordnet
+ corpora/shakespeare
+ corpora/sinica_treebank
+ corpora/state_union
+ corpora/stopwords
+ corpora/subjectivity
+ corpora/swadesh
+ corpora/switchboard
+ corpora/timit
+ corpora/toolbox
+ corpora/treebank
+ corpora/twitter_samples
+ corpora/udhr
+ corpora/udhr2
+ corpora/verbnet
+ corpora/webtext
+ corpora/wordnet
+ corpora/wordnet_ic
+ corpora/words
+ grammars/book_grammars
+ grammars/large_grammars
+ grammars/sample_grammars
+ misc/perluniprops
+ models/bllip_wsj_no_aux
+ models/moses_sample
+ models/wmt15_eval
+ models/word2vec_sample
+ stemmers/porter_test
+ stemmers/rslp
+ taggers/averaged_perceptron_tagger
+ taggers/averaged_perceptron_tagger_ru
+ taggers/universal_tagset
+ tokenizers/punkt
+)
+
+PACKAGES_UNPACK_EXTRA=(
+ chunkers/maxent_ne_chunker
+ corpora/biocreative_ppi
+ corpora/brown_tei
+ corpora/kimmo
+ corpora/paradigms
+ corpora/pe08
+ corpora/pil
+ corpora/pl196x
+ corpora/problem_reports
+ corpora/smultron
+ corpora/unicode_samples
+ corpora/verbnet3
+ corpora/ycoe
+ grammars/basque_grammars
+ grammars/spanish_grammars
+ help/tagsets
+ misc/mwa_ppdb
+ taggers/maxent_treebank_pos_tagger
+)
+
+add_data() {
+ local x
+ for x; do
+ SRC_URI+="
+ https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/${x}.zip
+ -> nltk-${x#*/}-${PV}.zip"
+ done
+}
+
+add_data "${PACKAGES_ZIP[@]}" "${PACKAGES_UNPACK[@]}"
+SRC_URI+="
+ extra? ("
+add_data "${PACKAGES_UNPACK_EXTRA[@]}"
+SRC_URI+="
+ )"
+
+CHECKREQS_DISK_USR=3G
+CHECKREQS_DISK_BUILD=${CHECKREQS_DISK_USR}
+
+src_unpack() {
+ local x
+ local to_unpack=( "${PACKAGES_UNPACK[@]}" )
+ use extra && to_unpack+=( "${PACKAGES_UNPACK_EXTRA[@]}" )
+ for x in "${to_unpack[@]}"; do
+ local cat=${x%/*}
+ local pkg=${x#*/}
+
+ mkdir -p "${S}/${cat}" || die
+ cd "${S}/${cat}" || die
+ unpack "nltk-${pkg}-${PV}.zip"
+ done
+}
+
+src_install() {
+ dodir /usr/share/nltk_data
+ mv * "${ED}/usr/share/nltk_data/" || die
+
+ local x
+ for x in "${PACKAGES_ZIP[@]}"; do
+ local cat=${x%/*}
+ local pkg=${x#*/}
+
+ insinto "/usr/share/nltk_data/${cat}"
+ newins "${DISTDIR}/nltk-${pkg}-${PV}.zip" "${pkg}.zip"
+ done
+}
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2020-04-13 10:32 Michał Górny
0 siblings, 0 replies; 14+ messages in thread
From: Michał Górny @ 2020-04-13 10:32 UTC (permalink / raw
To: gentoo-commits
commit: d2046bcd67c169466279d561c5a69006f2faa74d
Author: Michał Górny <mgorny <AT> gentoo <DOT> org>
AuthorDate: Mon Apr 13 09:26:40 2020 +0000
Commit: Michał Górny <mgorny <AT> gentoo <DOT> org>
CommitDate: Mon Apr 13 10:32:14 2020 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=d2046bcd
dev-python/nltk-data: Install corpora/pl196x unconditionally
Install corpora/pl196x unconditionally as it is required
for dev-python/nltk-3.5 tests.
Signed-off-by: Michał Górny <mgorny <AT> gentoo.org>
dev-python/nltk-data/nltk-data-20200312-r1.ebuild | 184 ++++++++++++++++++++++
1 file changed, 184 insertions(+)
diff --git a/dev-python/nltk-data/nltk-data-20200312-r1.ebuild b/dev-python/nltk-data/nltk-data-20200312-r1.ebuild
new file mode 100644
index 00000000000..c8f12c580e6
--- /dev/null
+++ b/dev-python/nltk-data/nltk-data-20200312-r1.ebuild
@@ -0,0 +1,184 @@
+# Copyright 2020 Gentoo Authors
+# Distributed under the terms of the GNU General Public License v2
+
+EAPI=7
+
+inherit check-reqs
+
+DESCRIPTION="Data files for NLTK"
+HOMEPAGE="https://www.nltk.org/nltk_data/"
+
+# at least some of the files have poorly documented licenses
+# TODO: create a USE flag for free-ish subset
+LICENSE="all-rights-reserved"
+SLOT="0"
+KEYWORDS="~amd64 ~x86"
+IUSE="extra"
+RESTRICT="bindist mirror"
+
+BDEPEND="app-arch/unzip"
+
+PACKAGES_ZIP=(
+ # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=0]' -v @subdir -o "/" -v @id -n - | sort
+ corpora/comtrans
+ corpora/conll2007
+ corpora/jeita
+ corpora/knbc
+ corpora/machado
+ corpora/masc_tagged
+ corpora/nombank.1.0
+ corpora/panlex_swadesh
+ corpora/propbank
+ corpora/reuters
+ corpora/semcor
+ corpora/universal_treebanks_v20
+ sentiment/vader_lexicon
+ stemmers/snowball_data
+)
+
+PACKAGES_UNPACK=(
+ # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=1]' -v @subdir -o "/" -v @id -n - | sort
+ corpora/abc
+ corpora/alpino
+ corpora/brown
+ corpora/cess_cat
+ corpora/cess_esp
+ corpora/chat80
+ corpora/city_database
+ corpora/cmudict
+ corpora/comparative_sentences
+ corpora/conll2000
+ corpora/conll2002
+ corpora/crubadan
+ corpora/dependency_treebank
+ corpora/dolch
+ corpora/europarl_raw
+ corpora/floresta
+ corpora/framenet_v15
+ corpora/framenet_v17
+ corpora/gazetteers
+ corpora/genesis
+ corpora/gutenberg
+ corpora/ieer
+ corpora/inaugural
+ corpora/indian
+ corpora/lin_thesaurus
+ corpora/mac_morpho
+ corpora/movie_reviews
+ corpora/mte_teip5
+ corpora/names
+ corpora/nonbreaking_prefixes
+ corpora/nps_chat
+ corpora/omw
+ corpora/opinion_lexicon
+ corpora/pl196x
+ corpora/ppattach
+ corpora/product_reviews_1
+ corpora/product_reviews_2
+ corpora/pros_cons
+ corpora/ptb
+ corpora/qc
+ corpora/rte
+ corpora/senseval
+ corpora/sentence_polarity
+ corpora/sentiwordnet
+ corpora/shakespeare
+ corpora/sinica_treebank
+ corpora/state_union
+ corpora/stopwords
+ corpora/subjectivity
+ corpora/swadesh
+ corpora/switchboard
+ corpora/timit
+ corpora/toolbox
+ corpora/treebank
+ corpora/twitter_samples
+ corpora/udhr
+ corpora/udhr2
+ corpora/verbnet
+ corpora/webtext
+ corpora/wordnet
+ corpora/wordnet_ic
+ corpora/words
+ grammars/book_grammars
+ grammars/large_grammars
+ grammars/sample_grammars
+ misc/perluniprops
+ models/bllip_wsj_no_aux
+ models/moses_sample
+ models/wmt15_eval
+ models/word2vec_sample
+ stemmers/porter_test
+ stemmers/rslp
+ taggers/averaged_perceptron_tagger
+ taggers/averaged_perceptron_tagger_ru
+ taggers/universal_tagset
+ tokenizers/punkt
+)
+
+PACKAGES_UNPACK_EXTRA=(
+ chunkers/maxent_ne_chunker
+ corpora/biocreative_ppi
+ corpora/brown_tei
+ corpora/kimmo
+ corpora/paradigms
+ corpora/pe08
+ corpora/pil
+ corpora/problem_reports
+ corpora/smultron
+ corpora/unicode_samples
+ corpora/verbnet3
+ corpora/ycoe
+ grammars/basque_grammars
+ grammars/spanish_grammars
+ help/tagsets
+ misc/mwa_ppdb
+ taggers/maxent_treebank_pos_tagger
+)
+
+add_data() {
+ local x
+ for x; do
+ SRC_URI+="
+ https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/${x}.zip
+ -> nltk-${x#*/}-${PV}.zip"
+ done
+}
+
+add_data "${PACKAGES_ZIP[@]}" "${PACKAGES_UNPACK[@]}"
+SRC_URI+="
+ extra? ("
+add_data "${PACKAGES_UNPACK_EXTRA[@]}"
+SRC_URI+="
+ )"
+
+CHECKREQS_DISK_USR=3G
+CHECKREQS_DISK_BUILD=${CHECKREQS_DISK_USR}
+
+src_unpack() {
+ local x
+ local to_unpack=( "${PACKAGES_UNPACK[@]}" )
+ use extra && to_unpack+=( "${PACKAGES_UNPACK_EXTRA[@]}" )
+ for x in "${to_unpack[@]}"; do
+ local cat=${x%/*}
+ local pkg=${x#*/}
+
+ mkdir -p "${S}/${cat}" || die
+ cd "${S}/${cat}" || die
+ unpack "nltk-${pkg}-${PV}.zip"
+ done
+}
+
+src_install() {
+ dodir /usr/share/nltk_data
+ mv * "${ED}/usr/share/nltk_data/" || die
+
+ local x
+ for x in "${PACKAGES_ZIP[@]}"; do
+ local cat=${x%/*}
+ local pkg=${x#*/}
+
+ insinto "/usr/share/nltk_data/${cat}"
+ newins "${DISTDIR}/nltk-${pkg}-${PV}.zip" "${pkg}.zip"
+ done
+}
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2020-05-04 11:26 Michał Górny
0 siblings, 0 replies; 14+ messages in thread
From: Michał Górny @ 2020-05-04 11:26 UTC (permalink / raw
To: gentoo-commits
commit: d6d068bfb0046fad9c2b2ddf686c69721ec6f4a5
Author: Michał Górny <mgorny <AT> gentoo <DOT> org>
AuthorDate: Mon May 4 11:26:10 2020 +0000
Commit: Michał Górny <mgorny <AT> gentoo <DOT> org>
CommitDate: Mon May 4 11:26:10 2020 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=d6d068bf
dev-python/nltk-data: Remove old
Signed-off-by: Michał Górny <mgorny <AT> gentoo.org>
dev-python/nltk-data/nltk-data-20200312.ebuild | 184 -------------------------
1 file changed, 184 deletions(-)
diff --git a/dev-python/nltk-data/nltk-data-20200312.ebuild b/dev-python/nltk-data/nltk-data-20200312.ebuild
deleted file mode 100644
index 1d02afe6be7..00000000000
--- a/dev-python/nltk-data/nltk-data-20200312.ebuild
+++ /dev/null
@@ -1,184 +0,0 @@
-# Copyright 2020 Gentoo Authors
-# Distributed under the terms of the GNU General Public License v2
-
-EAPI=7
-
-inherit check-reqs
-
-DESCRIPTION="Data files for NLTK"
-HOMEPAGE="https://www.nltk.org/nltk_data/"
-
-# at least some of the files have poorly documented licenses
-# TODO: create a USE flag for free-ish subset
-LICENSE="all-rights-reserved"
-SLOT="0"
-KEYWORDS="~amd64 ~x86"
-IUSE="extra"
-RESTRICT="bindist mirror"
-
-BDEPEND="app-arch/unzip"
-
-PACKAGES_ZIP=(
- # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=0]' -v @subdir -o "/" -v @id -n - | sort
- corpora/comtrans
- corpora/conll2007
- corpora/jeita
- corpora/knbc
- corpora/machado
- corpora/masc_tagged
- corpora/nombank.1.0
- corpora/panlex_swadesh
- corpora/propbank
- corpora/reuters
- corpora/semcor
- corpora/universal_treebanks_v20
- sentiment/vader_lexicon
- stemmers/snowball_data
-)
-
-PACKAGES_UNPACK=(
- # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=1]' -v @subdir -o "/" -v @id -n - | sort
- corpora/abc
- corpora/alpino
- corpora/brown
- corpora/cess_cat
- corpora/cess_esp
- corpora/chat80
- corpora/city_database
- corpora/cmudict
- corpora/comparative_sentences
- corpora/conll2000
- corpora/conll2002
- corpora/crubadan
- corpora/dependency_treebank
- corpora/dolch
- corpora/europarl_raw
- corpora/floresta
- corpora/framenet_v15
- corpora/framenet_v17
- corpora/gazetteers
- corpora/genesis
- corpora/gutenberg
- corpora/ieer
- corpora/inaugural
- corpora/indian
- corpora/lin_thesaurus
- corpora/mac_morpho
- corpora/movie_reviews
- corpora/mte_teip5
- corpora/names
- corpora/nonbreaking_prefixes
- corpora/nps_chat
- corpora/omw
- corpora/opinion_lexicon
- corpora/ppattach
- corpora/product_reviews_1
- corpora/product_reviews_2
- corpora/pros_cons
- corpora/ptb
- corpora/qc
- corpora/rte
- corpora/senseval
- corpora/sentence_polarity
- corpora/sentiwordnet
- corpora/shakespeare
- corpora/sinica_treebank
- corpora/state_union
- corpora/stopwords
- corpora/subjectivity
- corpora/swadesh
- corpora/switchboard
- corpora/timit
- corpora/toolbox
- corpora/treebank
- corpora/twitter_samples
- corpora/udhr
- corpora/udhr2
- corpora/verbnet
- corpora/webtext
- corpora/wordnet
- corpora/wordnet_ic
- corpora/words
- grammars/book_grammars
- grammars/large_grammars
- grammars/sample_grammars
- misc/perluniprops
- models/bllip_wsj_no_aux
- models/moses_sample
- models/wmt15_eval
- models/word2vec_sample
- stemmers/porter_test
- stemmers/rslp
- taggers/averaged_perceptron_tagger
- taggers/averaged_perceptron_tagger_ru
- taggers/universal_tagset
- tokenizers/punkt
-)
-
-PACKAGES_UNPACK_EXTRA=(
- chunkers/maxent_ne_chunker
- corpora/biocreative_ppi
- corpora/brown_tei
- corpora/kimmo
- corpora/paradigms
- corpora/pe08
- corpora/pil
- corpora/pl196x
- corpora/problem_reports
- corpora/smultron
- corpora/unicode_samples
- corpora/verbnet3
- corpora/ycoe
- grammars/basque_grammars
- grammars/spanish_grammars
- help/tagsets
- misc/mwa_ppdb
- taggers/maxent_treebank_pos_tagger
-)
-
-add_data() {
- local x
- for x; do
- SRC_URI+="
- https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/${x}.zip
- -> nltk-${x#*/}-${PV}.zip"
- done
-}
-
-add_data "${PACKAGES_ZIP[@]}" "${PACKAGES_UNPACK[@]}"
-SRC_URI+="
- extra? ("
-add_data "${PACKAGES_UNPACK_EXTRA[@]}"
-SRC_URI+="
- )"
-
-CHECKREQS_DISK_USR=3G
-CHECKREQS_DISK_BUILD=${CHECKREQS_DISK_USR}
-
-src_unpack() {
- local x
- local to_unpack=( "${PACKAGES_UNPACK[@]}" )
- use extra && to_unpack+=( "${PACKAGES_UNPACK_EXTRA[@]}" )
- for x in "${to_unpack[@]}"; do
- local cat=${x%/*}
- local pkg=${x#*/}
-
- mkdir -p "${S}/${cat}" || die
- cd "${S}/${cat}" || die
- unpack "nltk-${pkg}-${PV}.zip"
- done
-}
-
-src_install() {
- dodir /usr/share/nltk_data
- mv * "${ED}/usr/share/nltk_data/" || die
-
- local x
- for x in "${PACKAGES_ZIP[@]}"; do
- local cat=${x%/*}
- local pkg=${x#*/}
-
- insinto "/usr/share/nltk_data/${cat}"
- newins "${DISTDIR}/nltk-${pkg}-${PV}.zip" "${pkg}.zip"
- done
-}
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2020-05-09 11:24 Mikle Kolyada
0 siblings, 0 replies; 14+ messages in thread
From: Mikle Kolyada @ 2020-05-09 11:24 UTC (permalink / raw
To: gentoo-commits
commit: 86e8654f99e728e58d3a639baf78e190baff8526
Author: Mikle Kolyada <zlogene <AT> gentoo <DOT> org>
AuthorDate: Sat May 9 11:22:29 2020 +0000
Commit: Mikle Kolyada <zlogene <AT> gentoo <DOT> org>
CommitDate: Sat May 9 11:23:54 2020 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=86e8654f
dev-python/nltk-data: amd64 stable wrt bug #721496
Package-Manager: Portage-2.3.99, Repoman-2.3.22
RepoMan-Options: --include-arches="amd64"
Signed-off-by: Mikle Kolyada <zlogene <AT> gentoo.org>
dev-python/nltk-data/nltk-data-20200312-r1.ebuild | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/dev-python/nltk-data/nltk-data-20200312-r1.ebuild b/dev-python/nltk-data/nltk-data-20200312-r1.ebuild
index c8f12c580e6..d4effd97378 100644
--- a/dev-python/nltk-data/nltk-data-20200312-r1.ebuild
+++ b/dev-python/nltk-data/nltk-data-20200312-r1.ebuild
@@ -12,7 +12,7 @@ HOMEPAGE="https://www.nltk.org/nltk_data/"
# TODO: create a USE flag for free-ish subset
LICENSE="all-rights-reserved"
SLOT="0"
-KEYWORDS="~amd64 ~x86"
+KEYWORDS="amd64 ~x86"
IUSE="extra"
RESTRICT="bindist mirror"
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2020-05-09 11:24 Mikle Kolyada
0 siblings, 0 replies; 14+ messages in thread
From: Mikle Kolyada @ 2020-05-09 11:24 UTC (permalink / raw
To: gentoo-commits
commit: b10735d1b244609bb0dda3e3d8e5a92162c82b99
Author: Mikle Kolyada <zlogene <AT> gentoo <DOT> org>
AuthorDate: Sat May 9 11:23:16 2020 +0000
Commit: Mikle Kolyada <zlogene <AT> gentoo <DOT> org>
CommitDate: Sat May 9 11:23:57 2020 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=b10735d1
dev-python/nltk-data: x86 stable wrt bug #721496
Package-Manager: Portage-2.3.99, Repoman-2.3.22
RepoMan-Options: --include-arches="x86"
Signed-off-by: Mikle Kolyada <zlogene <AT> gentoo.org>
dev-python/nltk-data/nltk-data-20200312-r1.ebuild | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/dev-python/nltk-data/nltk-data-20200312-r1.ebuild b/dev-python/nltk-data/nltk-data-20200312-r1.ebuild
index d4effd97378..4a3d58c5db8 100644
--- a/dev-python/nltk-data/nltk-data-20200312-r1.ebuild
+++ b/dev-python/nltk-data/nltk-data-20200312-r1.ebuild
@@ -12,7 +12,7 @@ HOMEPAGE="https://www.nltk.org/nltk_data/"
# TODO: create a USE flag for free-ish subset
LICENSE="all-rights-reserved"
SLOT="0"
-KEYWORDS="amd64 ~x86"
+KEYWORDS="amd64 x86"
IUSE="extra"
RESTRICT="bindist mirror"
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2021-10-23 20:46 Michał Górny
0 siblings, 0 replies; 14+ messages in thread
From: Michał Górny @ 2021-10-23 20:46 UTC (permalink / raw
To: gentoo-commits
commit: 8fc9ac325105fab5a44261fd4874d87738d31c43
Author: Michał Górny <mgorny <AT> gentoo <DOT> org>
AuthorDate: Sat Oct 23 20:37:15 2021 +0000
Commit: Michał Górny <mgorny <AT> gentoo <DOT> org>
CommitDate: Sat Oct 23 20:46:04 2021 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=8fc9ac32
dev-python/nltk-data: Update data files for 20211023
Closes: https://bugs.gentoo.org/819780
Signed-off-by: Michał Górny <mgorny <AT> gentoo.org>
dev-python/nltk-data/Manifest | 3 +-
...0200312-r1.ebuild => nltk-data-20211023.ebuild} | 61 ++++++++++++++--------
2 files changed, 42 insertions(+), 22 deletions(-)
diff --git a/dev-python/nltk-data/Manifest b/dev-python/nltk-data/Manifest
index fa415ee7f30..0a6f8624fba 100644
--- a/dev-python/nltk-data/Manifest
+++ b/dev-python/nltk-data/Manifest
@@ -81,7 +81,7 @@ DIST nltk-smultron-20200312.zip 166207 BLAKE2B d0c3e75dd108965e260d913e0c02137da
DIST nltk-snowball_data-20200312.zip 6785405 BLAKE2B 44c10439b142540ac7eece967efa1431fd8f45342f0a90875dacf29ad374fe4c7d30af11d42ba45e0f1ec1836d56b2ff684ee352c5e8536cfb5db5eb7632285b SHA512 6c8a9259d88f6f7f499867d83b731de99d7fa4e8827ecedf836f653fc1a810efa9f6c5c6e2720a9e6610bc00978956b6a119bd08b70e3e241c4e9faccddd81d8
DIST nltk-spanish_grammars-20200312.zip 4047 BLAKE2B d8a8dbb558850a6a60f1fe5ab0f617f3a0f3c64bc7d49980cf793d374c6679d1bd42afb7e61776737b5eec162f2520abf2ee3acc92ea9ee0f397c3089b3b5b28 SHA512 4513347156e9351c259c0e2448198d68354bbc95e0a54561c31a88f13f333ebcba3e294c820cb62036665f2904eb6a7137546cd580e361c0423c30a8aed950b2
DIST nltk-state_union-20200312.zip 808757 BLAKE2B 2d352af0ced736d3c11a821eaba0b035b3b5b6b0f20db3bd5d4ac2451f99daf68bffd3ad984bd404ecc4a1e67ca9281c529af2cc9e295a3a7330f36ee9640bc8 SHA512 a6fc83e6230e57ba66a7af62f0d2a5d44a14530ed1e0e914590b3f8b5bf939967c126a5e6f6899ba9134843893f65212e836d311109393c1200191a5c3163485
-DIST nltk-stopwords-20200312.zip 23047 BLAKE2B a0677cc0d4a3d54ee6e27eab8fb7635d6cd29265204896870e57457a54459f1d6cabc0c4e76e749397f5eee299eed0b524fcbc2033ea17d81cdb6cd98c5ed968 SHA512 31774fd3db2e0fba0209db71c08f9b2d971311ea4a59739cfdc0a9ae34f6c6c593f2a991a14ee20b0de8b380215e609f8398ed50c546775322ab8c4c3f8d06dd
+DIST nltk-stopwords-20211023.zip 26220 BLAKE2B 50219508c5fb24c1392064ea0546ca9060829f51689c0d626bb99e3fa8f712df98ed475fc0d27d99f934fb156ce65df91a8e7c22e1b4f16833339fb0aea34a9e SHA512 8308623953560281288b64e695638ca3fa28e1b6201b538d01650d6dfd08e821687217c8d012e93adfa2a48afebdda11af1bb86d638358c2931f36754d1e15ef
DIST nltk-subjectivity-20200312.zip 521628 BLAKE2B 0a8777a5b91b1b825fbde41cc927d496480129f0a810349bddde2036ed16f37611f2bc3b007e74fe36523612529a92433d32d094be72d247f5faef8220c3c491 SHA512 a3cc4d2d20f26c5eabd86fbced2c013e69d46e607013278eb35831a62e57523a17aec1b580ba62c7a867e61a561e1b222d8430f0c1e2d429a9479e12b008c5b7
DIST nltk-swadesh-20200312.zip 22828 BLAKE2B 1cd9d5355c6b53694ec545ca001b0807b4912a7878ba075b0f81ce8b9b22a5c7a18cf52cf2449483a1c89cd20d8d86986dc3d827fa93a7ef5824fddbc0922025 SHA512 90cb32532a5378d05ce34b84b5f8363dbb32f24afac58b0dcc5cdccba98fd7d37def7f4fbe76b11c8c64059bd19df745562bfbf5f4c721d65ce9f4be1348ab76
DIST nltk-switchboard-20200312.zip 791161 BLAKE2B 211116a751ae246fa31b6aca96b396d3642d89ad112588a09f8d91a5b76dc41c7fb4d36c16c6358cd8e0da8056bc83598ed0dc635cf7b1fd8469a0e80b5f1761 SHA512 690e5392dc082c4ac550bde2848aa65117e7a25cbc4bce0887581c531d03be64e21f044ac0a3286648255f0edd7766b1161f5575ad5fd680c7303b34c3226b8f
@@ -102,6 +102,7 @@ DIST nltk-webtext-20200312.zip 646297 BLAKE2B ca072fc38c144b659c76c36c9161641c91
DIST nltk-wmt15_eval-20200312.zip 383096 BLAKE2B 119943db4240171077569b3302c678644c2c9547ba67bfd055751059e0a3ad3ab6a19e4eedf9108d313d46dcd36cf19e11d973981da8c70a01c4cb790a7bd739 SHA512 362395d4c77ebe92f4c19fa8c2000082dc7a2343acc19cccb596ca00db6c40c231b904d807f46e2691cc4c4a0c79d14873b2a1983a494f2ca1485d540d787ceb
DIST nltk-word2vec_sample-20200312.zip 49396025 BLAKE2B 0512b9bb7121a528190079f578e82b4e8f8021bfc0062cfa5613d260f3eee17460aecfeaacb65d950e79d27a653c78633a88c3638ec16377e2dbc3006387ebaa SHA512 09c30a4ab8f9fb6a5b36974b5953260d01cb4f285827fb90a374d054ad775ac978602ab56c452f46f4f8601312e232fd739d2f54dafa44ab8b7b01831cf0d9d1
DIST nltk-wordnet-20200312.zip 10775600 BLAKE2B dac56a8fb1fa6882b1871c394ad2acb2d3be739c424570e27c89fb6983df5f896a8f359092ba82752ddfc0531d83563a219e85f80124202f29bda93181efe4dc SHA512 1923a8bcd56fa0b9a9de91f53070dce28c3a7efbab11d2ef55c87134b1bf30de0f40abab59c39eb15dce54aec9491d8a5a259de212ff4cb25cde0ad09317009a
+DIST nltk-wordnet31-20211023.zip 11055271 BLAKE2B e41a1951af5a71c9506d1e948b860574c94ab0ef31c1789a7e7bfb29c6dccea5b1d8895007631f7b595e9f90306365b5042e7a80dc6e1364fdbf4a5f0cba3b28 SHA512 a86091bd55e3a706892550b232be8f5199092623f1f8305d8c9be967a8527fe7d4ecb6250c369b229fdf52b6f3008106b758adc355fa2ad08b5b0cf2a458c173
DIST nltk-wordnet_ic-20200312.zip 12056682 BLAKE2B c2dc2a646015b23699a72f636b588ec5718c70e6941d9d56863257e1e0396c8cf59ac1dc6ed74e5d7f0c2ee9129d63221a03967bf66a3d335e99160f295ed44d SHA512 1c94451a13af6c76bff60a0cab2e70402a3d9abd2e8fc62a5473f24ab4229feb0afe4faa8d389734697a6cf86d2c8b1dc700bb3afa3cbc279b75d7e0ec19fc6d
DIST nltk-words-20200312.zip 757777 BLAKE2B eaaaaab6c26e206e9b6ce45daf779e3cc6706a06132afeabf013026d0009caee2d678f3c4ea9125b9654f7143bef29ec7a5706b79e5650ea556c6821b7754e6f SHA512 2810f05d3fc7ee6b6f8636fa1ff7b4e8c8cdac12b415cc54d15c69102290122ea138ec4fa36cb483f790c1ac10b0f83ae4c2c3e0e8df7e67e90e962ee5dbb0be
DIST nltk-ycoe-20200312.zip 477 BLAKE2B 574835aa011a06a06363e26facd6a6f583a1dc1cac2de39adff59d8ab48eefac030b43d935a2f79af855259f2a9a571193dae2811589483af97406ff05c76c9e SHA512 e39ce165074d10ff63cb84ea52905d7ecb937797c8123ed113c5609afe1f63ac44d04d48a681002c4eac21dc9076ac74164b886c6f9ce42f3a102c38d1e8e756
diff --git a/dev-python/nltk-data/nltk-data-20200312-r1.ebuild b/dev-python/nltk-data/nltk-data-20211023.ebuild
similarity index 80%
rename from dev-python/nltk-data/nltk-data-20200312-r1.ebuild
rename to dev-python/nltk-data/nltk-data-20211023.ebuild
index 4a3d58c5db8..df8437c785b 100644
--- a/dev-python/nltk-data/nltk-data-20200312-r1.ebuild
+++ b/dev-python/nltk-data/nltk-data-20211023.ebuild
@@ -1,4 +1,4 @@
-# Copyright 2020 Gentoo Authors
+# Copyright 2020-2021 Gentoo Authors
# Distributed under the terms of the GNU General Public License v2
EAPI=7
@@ -18,7 +18,7 @@ RESTRICT="bindist mirror"
BDEPEND="app-arch/unzip"
-PACKAGES_ZIP=(
+PACKAGES_ZIP_2020=(
# wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=0]' -v @subdir -o "/" -v @id -n - | sort
corpora/comtrans
corpora/conll2007
@@ -36,7 +36,7 @@ PACKAGES_ZIP=(
stemmers/snowball_data
)
-PACKAGES_UNPACK=(
+PACKAGES_UNPACK_2020=(
# wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=1]' -v @subdir -o "/" -v @id -n - | sort
corpora/abc
corpora/alpino
@@ -85,7 +85,6 @@ PACKAGES_UNPACK=(
corpora/shakespeare
corpora/sinica_treebank
corpora/state_union
- corpora/stopwords
corpora/subjectivity
corpora/swadesh
corpora/switchboard
@@ -116,7 +115,12 @@ PACKAGES_UNPACK=(
tokenizers/punkt
)
-PACKAGES_UNPACK_EXTRA=(
+PACKAGES_UNPACK_2021=(
+ corpora/stopwords
+ corpora/wordnet31
+)
+
+PACKAGES_UNPACK_EXTRA_2020=(
chunkers/maxent_ne_chunker
corpora/biocreative_ppi
corpora/brown_tei
@@ -137,48 +141,63 @@ PACKAGES_UNPACK_EXTRA=(
)
add_data() {
- local x
+ local x version=${1}
+ shift
+
for x; do
SRC_URI+="
https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/${x}.zip
- -> nltk-${x#*/}-${PV}.zip"
+ -> nltk-${x#*/}-${version}.zip"
done
}
-add_data "${PACKAGES_ZIP[@]}" "${PACKAGES_UNPACK[@]}"
+add_data 20200312 "${PACKAGES_ZIP_2020[@]}" "${PACKAGES_UNPACK_2020[@]}"
+add_data 20211023 "${PACKAGES_UNPACK_2021[@]}"
SRC_URI+="
extra? ("
-add_data "${PACKAGES_UNPACK_EXTRA[@]}"
+add_data 20200312 "${PACKAGES_UNPACK_EXTRA_2020[@]}"
SRC_URI+="
)"
CHECKREQS_DISK_USR=3G
CHECKREQS_DISK_BUILD=${CHECKREQS_DISK_USR}
-src_unpack() {
- local x
- local to_unpack=( "${PACKAGES_UNPACK[@]}" )
- use extra && to_unpack+=( "${PACKAGES_UNPACK_EXTRA[@]}" )
- for x in "${to_unpack[@]}"; do
+unpack_data() {
+ local x version=${1}
+ shift
+
+ for x; do
local cat=${x%/*}
local pkg=${x#*/}
mkdir -p "${S}/${cat}" || die
cd "${S}/${cat}" || die
- unpack "nltk-${pkg}-${PV}.zip"
+ unpack "nltk-${pkg}-${version}.zip"
done
}
-src_install() {
- dodir /usr/share/nltk_data
- mv * "${ED}/usr/share/nltk_data/" || die
+src_unpack() {
+ unpack_data 20200312 "${PACKAGES_UNPACK_2020[@]}"
+ unpack_data 20211023 "${PACKAGES_UNPACK_2021[@]}"
+ use extra && unpack_data 20200312 "${PACKAGES_UNPACK_EXTRA_2020[@]}"
+}
+
+install_zips() {
+ local x version=${1}
+ shift
- local x
- for x in "${PACKAGES_ZIP[@]}"; do
+ for x; do
local cat=${x%/*}
local pkg=${x#*/}
insinto "/usr/share/nltk_data/${cat}"
- newins "${DISTDIR}/nltk-${pkg}-${PV}.zip" "${pkg}.zip"
+ newins "${DISTDIR}/nltk-${pkg}-${version}.zip" "${pkg}.zip"
done
}
+
+src_install() {
+ dodir /usr/share/nltk_data
+ mv * "${ED}/usr/share/nltk_data/" || die
+
+ install_zips 20200312 "${PACKAGES_ZIP_2020[@]}"
+}
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2021-12-21 9:42 Michał Górny
0 siblings, 0 replies; 14+ messages in thread
From: Michał Górny @ 2021-12-21 9:42 UTC (permalink / raw
To: gentoo-commits
commit: bc7f4eb883bfc42ec54af84e582aa186f49bac0e
Author: Michał Górny <mgorny <AT> gentoo <DOT> org>
AuthorDate: Tue Dec 21 07:19:06 2021 +0000
Commit: Michał Górny <mgorny <AT> gentoo <DOT> org>
CommitDate: Tue Dec 21 09:42:47 2021 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=bc7f4eb8
dev-python/nltk-data: Add new data (20211221)
This one goes straight to stable since it's mirror-restricted
and upstream replaced the files in place again!
Signed-off-by: Michał Górny <mgorny <AT> gentoo.org>
dev-python/nltk-data/Manifest | 5 +
dev-python/nltk-data/nltk-data-20211221.ebuild | 210 +++++++++++++++++++++++++
2 files changed, 215 insertions(+)
diff --git a/dev-python/nltk-data/Manifest b/dev-python/nltk-data/Manifest
index 0a6f8624fbaa..2a75eb265207 100644
--- a/dev-python/nltk-data/Manifest
+++ b/dev-python/nltk-data/Manifest
@@ -30,6 +30,7 @@ DIST nltk-genesis-20200312.zip 473239 BLAKE2B 5234c2bcc31ae738bbd6cdfe1ee231a75c
DIST nltk-gutenberg-20200312.zip 4251829 BLAKE2B 54448e919010f12faa92c0ffd4adb31a7d1d76c262d5ac99c10129d188830107621f90e4ba611ffc1541c25037fc916b451169a8ae1df276747890fea2b5e06f SHA512 7b4442d18b89f9bb8261f23ad2f4d513b470461255633cbebe2b0c310f003effe819f7a86b39a62a6cde1b3fdbf099dabbdaaed520268c99fd1bb933baa97ead
DIST nltk-ieer-20200312.zip 166156 BLAKE2B 4c63be6f0f16e59584b7f97c21da0cf60a1efd3678ce662d683d8a6a9d05d333690b26ebb037d3047b26e3b51163ffa666f048ad8a89ee6c851ddd3edd2faf58 SHA512 b55545f0ac03282fb85c4a86bb69fd9988db8cc504114bff7b763007e20f559efa61b8ae5b7955c251c1c2dd9d23246e23515b1cd5d8d28225c20ec236c05cd8
DIST nltk-inaugural-20200312.zip 329806 BLAKE2B 222e22742d93156b852ddd72ffccdd5be5c907f43973b0ceee8d8d383bc5387575d1f974b29fd6d59499ec86483f56270251e6a5e48d7840c577207849baed59 SHA512 096591769373019883b9a2f85d32bc86021aefb74deb26dc9005233ae7f317d587c247edc53f1a956b8e29911656419d9677c271fb59ded4ca4edf2a80ff86b7
+DIST nltk-inaugural-20211221.zip 346476 BLAKE2B 6f14827fcd8edf9300725c6ffcfa899f5878537aaefc40b165581af60866ca7e7b0ed316a2b8dd87ff858667ce7a5f4035090a1cc3a404afa64b28d34dcf8747 SHA512 5440bff001209e42ee86c2501d2cd628cf639734c194c11097a862f6759f09a6adef1da0702b77211774f84d5d9d4ee55e89085b39d8a911811def1bd07ac8e2
DIST nltk-indian-20200312.zip 199187 BLAKE2B 9f70f1f28bcb96d828ca2ed96d6f89cd8eca8391b64842c742e940aaf59f31e142a4919ab5a21bce954bbe2ed85add162ac67c9549758f9aa44b3e90d45d44a7 SHA512 08e9342567bcc607e9007aea8415af2cad1f8a289104b47a61bf569f1391629ebbe25f19aacf53e2b14725edfdf18076331f044e1c2e42d07c96d5c875ff8f9e
DIST nltk-jeita-20200312.zip 16531215 BLAKE2B 7cf98b389d5c4b41d9d613c14c5116b83c520cfe7418ef982a6acc1294d8ecdb8a5d6d396409c9f13ca83df999793619d5eb5483612e87874766afe565c68362 SHA512 491e6dc27d70ebfd139be4dcd179b02c383f38edbe4290aed85638ecd7c987259b025e4f6f3f1d5bfe2b7aa3eff7188109205417d0acb64759988150f2753129
DIST nltk-kimmo-20200312.zip 186958 BLAKE2B ca288f48c09a33b86449c780e5c8612c3521e22bfe70918629bf3144b75b5566a942c3cc1440a13c2d81e3a69c77928c336850cc9fac4c232deb77293bfbe4e6 SHA512 fe38d5298ffb6dd62223c2002278d9ddf2695f341a29477f44434644bc6766cfe2c73091067f9eb8aea201735eef5012f96ba267f2e6ea3aec41ac206a43cac2
@@ -49,6 +50,7 @@ DIST nltk-names-20200312.zip 21326 BLAKE2B 37389f44337724fac26eaad5a4c310ea244b4
DIST nltk-nombank.1.0-20200312.zip 6728397 BLAKE2B 4dfb61fb651817086ad20957d02a39c99dff83ec45dc1a01e1dac055e0d7d9ef8108ef8df5218ce57b2b0866b748f5f76663a58e08f5e1731272173888e714e9 SHA512 81bcf29898ba1205ccbc8486d14518e95b0189b97b172a54580d4cb4584f4ef32a5284fe0abed0b954e67a318de8f10de5df1a20e6c08976df09cd2c9c66cb32
DIST nltk-nonbreaking_prefixes-20200312.zip 25437 BLAKE2B 9f26efbcc429f2d36a5584014604d2ca9d4de3864aa6192a114e9e2cd55ff1df50d4321a8c40dbe96225fad6be51f7be66f8a4d58146e370a1d33a9305f7f95f SHA512 4cddfdacf1c7fe4e93a31333413a83ebea2ea7907f4e6fe99e1c6ff4aed8efc951f4399478cdb88e5d5dc01cfb734b6672f4007b2a17c4784896fecaf4a93cdf
DIST nltk-nps_chat-20200312.zip 301366 BLAKE2B eadf0d0ee54c982765a124321fe6de161bfbfbfb0385d5d7e171666945ef313b960ba7b820e4fed02885ffc213173e2a7c97822513ae6e81e68858af21d9f424 SHA512 a716f3aaba3aedbbdaeaff1c6f1ef607476aa989eaef1c2971b827215f3167952477b203f1543d0fd27e5594c3efa01349c6a0efc1ac423048808f18eae94643
+DIST nltk-omw-1.4-20211221.zip 26634772 BLAKE2B 4d5fe648987b9aaa589d612f439560287200fe947a80182b941b0660a97a739ddda00a472209a26715aa72c44f37cdb5dbce9df89cb7bfbcdb7f4fa2908bb96e SHA512 d25c591c894fedeaf8649ef26653c51b74282e8af43bd0a8867eb16f79a89c4b567328a7c83f7651ec9e41dc746cba87b0c8bdbbe736db7f18c62a56820a92e6
DIST nltk-omw-20200312.zip 12110409 BLAKE2B 5ae634fdd89fcd2be2dea1511b2c434755e16b92804ea6bb5e2139113c644885d142e6290ab1d00c418f9df5241ebf6b7e227da120dc95d1e907d8f62a3d0cb8 SHA512 05b1b014e55738c49b34ea0371b95bd1d1aadfb0cc57325ec110ffbbb1250690e3a4aab45e877eb5433de08c7afe3ffa161e29976a0e28db9803fb637b24c843
DIST nltk-opinion_lexicon-20200312.zip 24947 BLAKE2B 47f5a381647075f4bea0230939098a6e83b47860165595e72f076a5436e2aad78148606d86ec331b1cc323963cb2a7500669c6fc2aff2e889bbaec8fd8571722 SHA512 75b98afa24c4e0284b2858770d7655c5c399066f56c1b9aa03dd51d80f211a694856695f8192570f67f02f0827ab02c3f3b65feb5a56c04bebfae3b58ca56ba1
DIST nltk-panlex_swadesh-20200312.zip 2861668 BLAKE2B e9ee3b2296c8738319a86dfb39d34fc5bcd346e725e0ad0df74571106ef05b71126aff70914e914cff436674c0d70c71ca36121331ab0327932505a2bdb26be1 SHA512 d096be1c6ea54814e448f61048f05df9107c0ca5f75eb3051a009efd4643c94188dde03505b0a4ae29d301268ffc5153eeee7012aae18310f9288fd4ccdb2287
@@ -77,6 +79,7 @@ DIST nltk-sentence_polarity-20200312.zip 490256 BLAKE2B 56a1b4668875d5c73b220710
DIST nltk-sentiwordnet-20200312.zip 4686546 BLAKE2B 98a45d9a5be971bd9e76ad5c154359c1800aa96ab1e8e301215e435cae6d659cc0fb38604435da4e9a391741a72ef7fcb842a1cf119348a7039f717843035f12 SHA512 cd0ff31abec3b85e3114546918c9152360e3da45fbbaf7b4c76a13a6d288f1d9ea7a4633638080cd0205cc293b4123445106e9b5daddb2ae379d6a08d1c992a5
DIST nltk-shakespeare-20200312.zip 475458 BLAKE2B c60acbf2d90f5fffdae43bdf85191dbd694ec5ae19e37b145ad04af6d8481988f0c2ce644baff2e31707e86d95a9b3b03c1da4faaa46be92da2291cce565955e SHA512 72cbbdb237b78df1f6bf27b5504606f805433746b9862dd1ce544ab712783a41fb165f3a4f282a7f1bd58c7dfe6036c14ecb07a82c109f6d326eb95e470002fb
DIST nltk-sinica_treebank-20200312.zip 899237 BLAKE2B e074dbdc7f1fe5958109f8ea019bead946bd2e3cd1c0849cf3ad49eaa34a1989baee7935a70e9103c121e5d375954d1dc53e4a1599e394ce6c2fda4e98769645 SHA512 724fb0905cbd0f903c3a75d70aee8a3e6f29e62971c1501efd8bfb3f4c6619e5fd8a285648e7792ea8e86bbda0a2afc6eecae45ad9f361983bf83b756ab52960
+DIST nltk-sinica_treebank-20211221.zip 906706 BLAKE2B 0c00bc7d39765b86eb99f55e82522d390cf0270fd2e2688d5825481b6f11e54aaa71b3f9ac95af170be543353746777da130bb6b294b0529330c712f9edaaab8 SHA512 22d44e23e89691f775a0e54971fada692aecd7cfa5181d49e9a53d0a96dc3be1beaa1e1ec6840f9b2756c5c16c93f96b571f6b08c175701485afc0c32261cac8
DIST nltk-smultron-20200312.zip 166207 BLAKE2B d0c3e75dd108965e260d913e0c02137da8707a751a4f53a1f59ff6a26af9997e2f0193980b5087277b87b74d9aab3e3c8d066d7264311cec2a94f0f9d21caf44 SHA512 8872717994269caff4ccf3f52cef273b2ef007efeb8ab89392cb4db3b8e3ba4b8859ab2fd70762b8941bee4558aebba3021869772be17a64e1056968aa605e19
DIST nltk-snowball_data-20200312.zip 6785405 BLAKE2B 44c10439b142540ac7eece967efa1431fd8f45342f0a90875dacf29ad374fe4c7d30af11d42ba45e0f1ec1836d56b2ff684ee352c5e8536cfb5db5eb7632285b SHA512 6c8a9259d88f6f7f499867d83b731de99d7fa4e8827ecedf836f653fc1a810efa9f6c5c6e2720a9e6610bc00978956b6a119bd08b70e3e241c4e9faccddd81d8
DIST nltk-spanish_grammars-20200312.zip 4047 BLAKE2B d8a8dbb558850a6a60f1fe5ab0f617f3a0f3c64bc7d49980cf793d374c6679d1bd42afb7e61776737b5eec162f2520abf2ee3acc92ea9ee0f397c3089b3b5b28 SHA512 4513347156e9351c259c0e2448198d68354bbc95e0a54561c31a88f13f333ebcba3e294c820cb62036665f2904eb6a7137546cd580e361c0423c30a8aed950b2
@@ -102,7 +105,9 @@ DIST nltk-webtext-20200312.zip 646297 BLAKE2B ca072fc38c144b659c76c36c9161641c91
DIST nltk-wmt15_eval-20200312.zip 383096 BLAKE2B 119943db4240171077569b3302c678644c2c9547ba67bfd055751059e0a3ad3ab6a19e4eedf9108d313d46dcd36cf19e11d973981da8c70a01c4cb790a7bd739 SHA512 362395d4c77ebe92f4c19fa8c2000082dc7a2343acc19cccb596ca00db6c40c231b904d807f46e2691cc4c4a0c79d14873b2a1983a494f2ca1485d540d787ceb
DIST nltk-word2vec_sample-20200312.zip 49396025 BLAKE2B 0512b9bb7121a528190079f578e82b4e8f8021bfc0062cfa5613d260f3eee17460aecfeaacb65d950e79d27a653c78633a88c3638ec16377e2dbc3006387ebaa SHA512 09c30a4ab8f9fb6a5b36974b5953260d01cb4f285827fb90a374d054ad775ac978602ab56c452f46f4f8601312e232fd739d2f54dafa44ab8b7b01831cf0d9d1
DIST nltk-wordnet-20200312.zip 10775600 BLAKE2B dac56a8fb1fa6882b1871c394ad2acb2d3be739c424570e27c89fb6983df5f896a8f359092ba82752ddfc0531d83563a219e85f80124202f29bda93181efe4dc SHA512 1923a8bcd56fa0b9a9de91f53070dce28c3a7efbab11d2ef55c87134b1bf30de0f40abab59c39eb15dce54aec9491d8a5a259de212ff4cb25cde0ad09317009a
+DIST nltk-wordnet2021-20211221.zip 11332750 BLAKE2B f4296728b803ef5a48cf312e2dd1471e8a240a0486e0c45b13c4c9998b8577ac30f87d151ffb2957e319e2b7b82706eba82b7b21530bfb3600d7cff2f4aefa0b SHA512 f27e847e0fd3fda83848af8bf94793d0e89c5f89b7868e222ba60be59122587f524e124e9d84f0b904b3a48808aae1c7d849f5851e2f3119ec99b3c30baac400
DIST nltk-wordnet31-20211023.zip 11055271 BLAKE2B e41a1951af5a71c9506d1e948b860574c94ab0ef31c1789a7e7bfb29c6dccea5b1d8895007631f7b595e9f90306365b5042e7a80dc6e1364fdbf4a5f0cba3b28 SHA512 a86091bd55e3a706892550b232be8f5199092623f1f8305d8c9be967a8527fe7d4ecb6250c369b229fdf52b6f3008106b758adc355fa2ad08b5b0cf2a458c173
+DIST nltk-wordnet31-20211221.zip 11058667 BLAKE2B fb2dca62f602e05226dd62802f42e5f74d11cc535e71b1dc634a202a8c52a7912b1eb0b71edb3c564520c7c05251d9a9928abe88398b816a43035a6db4bda1b3 SHA512 dffea47ded52a042594dced94bc877686d78e6dc9d5a9db5f2a1718190f411cc991b04028d0a4407a8354afd695258e2a5ae881d5f6d9eaf0c3d30de1a8cbf96
DIST nltk-wordnet_ic-20200312.zip 12056682 BLAKE2B c2dc2a646015b23699a72f636b588ec5718c70e6941d9d56863257e1e0396c8cf59ac1dc6ed74e5d7f0c2ee9129d63221a03967bf66a3d335e99160f295ed44d SHA512 1c94451a13af6c76bff60a0cab2e70402a3d9abd2e8fc62a5473f24ab4229feb0afe4faa8d389734697a6cf86d2c8b1dc700bb3afa3cbc279b75d7e0ec19fc6d
DIST nltk-words-20200312.zip 757777 BLAKE2B eaaaaab6c26e206e9b6ce45daf779e3cc6706a06132afeabf013026d0009caee2d678f3c4ea9125b9654f7143bef29ec7a5706b79e5650ea556c6821b7754e6f SHA512 2810f05d3fc7ee6b6f8636fa1ff7b4e8c8cdac12b415cc54d15c69102290122ea138ec4fa36cb483f790c1ac10b0f83ae4c2c3e0e8df7e67e90e962ee5dbb0be
DIST nltk-ycoe-20200312.zip 477 BLAKE2B 574835aa011a06a06363e26facd6a6f583a1dc1cac2de39adff59d8ab48eefac030b43d935a2f79af855259f2a9a571193dae2811589483af97406ff05c76c9e SHA512 e39ce165074d10ff63cb84ea52905d7ecb937797c8123ed113c5609afe1f63ac44d04d48a681002c4eac21dc9076ac74164b886c6f9ce42f3a102c38d1e8e756
diff --git a/dev-python/nltk-data/nltk-data-20211221.ebuild b/dev-python/nltk-data/nltk-data-20211221.ebuild
new file mode 100644
index 000000000000..104917b4e5a0
--- /dev/null
+++ b/dev-python/nltk-data/nltk-data-20211221.ebuild
@@ -0,0 +1,210 @@
+# Copyright 2020-2021 Gentoo Authors
+# Distributed under the terms of the GNU General Public License v2
+
+EAPI=8
+
+inherit check-reqs
+
+DESCRIPTION="Data files for NLTK"
+HOMEPAGE="https://www.nltk.org/nltk_data/"
+
+# at least some of the files have poorly documented licenses
+# TODO: create a USE flag for free-ish subset
+LICENSE="all-rights-reserved"
+SLOT="0"
+KEYWORDS="amd64 x86"
+IUSE="extra"
+RESTRICT="bindist mirror"
+
+BDEPEND="app-arch/unzip"
+
+PACKAGES_ZIP_2020=(
+ # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=0]' -v @subdir -o "/" -v @id -n - | sort
+ corpora/comtrans
+ corpora/conll2007
+ corpora/jeita
+ corpora/knbc
+ corpora/machado
+ corpora/masc_tagged
+ corpora/nombank.1.0
+ corpora/panlex_swadesh
+ corpora/propbank
+ corpora/reuters
+ corpora/semcor
+ corpora/universal_treebanks_v20
+ sentiment/vader_lexicon
+ stemmers/snowball_data
+)
+
+PACKAGES_UNPACK_2020=(
+ # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=1]' -v @subdir -o "/" -v @id -n - | sort
+ corpora/abc
+ corpora/alpino
+ corpora/brown
+ corpora/cess_cat
+ corpora/cess_esp
+ corpora/chat80
+ corpora/city_database
+ corpora/cmudict
+ corpora/comparative_sentences
+ corpora/conll2000
+ corpora/conll2002
+ corpora/crubadan
+ corpora/dependency_treebank
+ corpora/dolch
+ corpora/europarl_raw
+ corpora/floresta
+ corpora/framenet_v15
+ corpora/framenet_v17
+ corpora/gazetteers
+ corpora/genesis
+ corpora/gutenberg
+ corpora/ieer
+ corpora/indian
+ corpora/lin_thesaurus
+ corpora/mac_morpho
+ corpora/movie_reviews
+ corpora/mte_teip5
+ corpora/names
+ corpora/nonbreaking_prefixes
+ corpora/nps_chat
+ corpora/omw
+ corpora/opinion_lexicon
+ corpora/pl196x
+ corpora/ppattach
+ corpora/product_reviews_1
+ corpora/product_reviews_2
+ corpora/pros_cons
+ corpora/ptb
+ corpora/qc
+ corpora/rte
+ corpora/senseval
+ corpora/sentence_polarity
+ corpora/sentiwordnet
+ corpora/shakespeare
+ corpora/state_union
+ corpora/subjectivity
+ corpora/swadesh
+ corpora/switchboard
+ corpora/timit
+ corpora/toolbox
+ corpora/treebank
+ corpora/twitter_samples
+ corpora/udhr
+ corpora/udhr2
+ corpora/verbnet
+ corpora/webtext
+ corpora/wordnet
+ corpora/wordnet_ic
+ corpora/words
+ grammars/book_grammars
+ grammars/large_grammars
+ grammars/sample_grammars
+ misc/perluniprops
+ models/bllip_wsj_no_aux
+ models/moses_sample
+ models/wmt15_eval
+ models/word2vec_sample
+ stemmers/porter_test
+ stemmers/rslp
+ taggers/averaged_perceptron_tagger
+ taggers/averaged_perceptron_tagger_ru
+ taggers/universal_tagset
+ tokenizers/punkt
+)
+
+PACKAGES_UNPACK_2021=(
+ corpora/stopwords
+)
+
+PACKAGES_UNPACK_2021_12=(
+ corpora/inaugural
+ corpora/omw-1.4
+ corpora/sinica_treebank
+ corpora/wordnet2021
+ corpora/wordnet31
+)
+
+PACKAGES_UNPACK_EXTRA_2020=(
+ chunkers/maxent_ne_chunker
+ corpora/biocreative_ppi
+ corpora/brown_tei
+ corpora/kimmo
+ corpora/paradigms
+ corpora/pe08
+ corpora/pil
+ corpora/problem_reports
+ corpora/smultron
+ corpora/unicode_samples
+ corpora/verbnet3
+ corpora/ycoe
+ grammars/basque_grammars
+ grammars/spanish_grammars
+ help/tagsets
+ misc/mwa_ppdb
+ taggers/maxent_treebank_pos_tagger
+)
+
+add_data() {
+ local x version=${1}
+ shift
+
+ for x; do
+ SRC_URI+="
+ https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/${x}.zip
+ -> nltk-${x#*/}-${version}.zip"
+ done
+}
+
+add_data 20200312 "${PACKAGES_ZIP_2020[@]}" "${PACKAGES_UNPACK_2020[@]}"
+add_data 20211023 "${PACKAGES_UNPACK_2021[@]}"
+add_data 20211221 "${PACKAGES_UNPACK_2021_12[@]}"
+SRC_URI+="
+ extra? ("
+add_data 20200312 "${PACKAGES_UNPACK_EXTRA_2020[@]}"
+SRC_URI+="
+ )"
+
+CHECKREQS_DISK_USR=3G
+CHECKREQS_DISK_BUILD=${CHECKREQS_DISK_USR}
+
+unpack_data() {
+ local x version=${1}
+ shift
+
+ for x; do
+ local cat=${x%/*}
+ local pkg=${x#*/}
+
+ mkdir -p "${S}/${cat}" || die
+ cd "${S}/${cat}" || die
+ unpack "nltk-${pkg}-${version}.zip"
+ done
+}
+
+src_unpack() {
+ unpack_data 20200312 "${PACKAGES_UNPACK_2020[@]}"
+ unpack_data 20211023 "${PACKAGES_UNPACK_2021[@]}"
+ unpack_data 20211221 "${PACKAGES_UNPACK_2021_12[@]}"
+ use extra && unpack_data 20200312 "${PACKAGES_UNPACK_EXTRA_2020[@]}"
+}
+
+install_zips() {
+ local x version=${1}
+ shift
+
+ for x; do
+ local cat=${x%/*}
+ local pkg=${x#*/}
+
+ insinto "/usr/share/nltk_data/${cat}"
+ newins "${DISTDIR}/nltk-${pkg}-${version}.zip" "${pkg}.zip"
+ done
+}
+
+src_install() {
+ dodir /usr/share/nltk_data
+ mv * "${ED}/usr/share/nltk_data/" || die
+
+ install_zips 20200312 "${PACKAGES_ZIP_2020[@]}"
+}
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2021-12-21 9:42 Michał Górny
0 siblings, 0 replies; 14+ messages in thread
From: Michał Górny @ 2021-12-21 9:42 UTC (permalink / raw
To: gentoo-commits
commit: 08f79ab2f283b49343dd5a4f263eeb5719b1d636
Author: Michał Górny <mgorny <AT> gentoo <DOT> org>
AuthorDate: Tue Dec 21 07:27:25 2021 +0000
Commit: Michał Górny <mgorny <AT> gentoo <DOT> org>
CommitDate: Tue Dec 21 09:42:49 2021 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=08f79ab2
dev-python/nltk-data: Remove old
Signed-off-by: Michał Górny <mgorny <AT> gentoo.org>
dev-python/nltk-data/Manifest | 3 -
dev-python/nltk-data/nltk-data-20211023.ebuild | 203 -------------------------
2 files changed, 206 deletions(-)
diff --git a/dev-python/nltk-data/Manifest b/dev-python/nltk-data/Manifest
index 2a75eb265207..3a61ebbf461e 100644
--- a/dev-python/nltk-data/Manifest
+++ b/dev-python/nltk-data/Manifest
@@ -29,7 +29,6 @@ DIST nltk-gazetteers-20200312.zip 8265 BLAKE2B aa4f8db99a7ae8c26911d33abb67713bc
DIST nltk-genesis-20200312.zip 473239 BLAKE2B 5234c2bcc31ae738bbd6cdfe1ee231a75ceacc74334e3b03fcefb409427406aee1700e0d8ca8fad1444736005b53fdb0fbdb9068085837f4bf7292cf61351b8b SHA512 e74f6a8f304bca292bf8e138a739a97633075cf9374c95f43ec75473cd7bbeb0921fe24f7864659e640e694d227e61debd67bc41150bd898b7241a274e28196f
DIST nltk-gutenberg-20200312.zip 4251829 BLAKE2B 54448e919010f12faa92c0ffd4adb31a7d1d76c262d5ac99c10129d188830107621f90e4ba611ffc1541c25037fc916b451169a8ae1df276747890fea2b5e06f SHA512 7b4442d18b89f9bb8261f23ad2f4d513b470461255633cbebe2b0c310f003effe819f7a86b39a62a6cde1b3fdbf099dabbdaaed520268c99fd1bb933baa97ead
DIST nltk-ieer-20200312.zip 166156 BLAKE2B 4c63be6f0f16e59584b7f97c21da0cf60a1efd3678ce662d683d8a6a9d05d333690b26ebb037d3047b26e3b51163ffa666f048ad8a89ee6c851ddd3edd2faf58 SHA512 b55545f0ac03282fb85c4a86bb69fd9988db8cc504114bff7b763007e20f559efa61b8ae5b7955c251c1c2dd9d23246e23515b1cd5d8d28225c20ec236c05cd8
-DIST nltk-inaugural-20200312.zip 329806 BLAKE2B 222e22742d93156b852ddd72ffccdd5be5c907f43973b0ceee8d8d383bc5387575d1f974b29fd6d59499ec86483f56270251e6a5e48d7840c577207849baed59 SHA512 096591769373019883b9a2f85d32bc86021aefb74deb26dc9005233ae7f317d587c247edc53f1a956b8e29911656419d9677c271fb59ded4ca4edf2a80ff86b7
DIST nltk-inaugural-20211221.zip 346476 BLAKE2B 6f14827fcd8edf9300725c6ffcfa899f5878537aaefc40b165581af60866ca7e7b0ed316a2b8dd87ff858667ce7a5f4035090a1cc3a404afa64b28d34dcf8747 SHA512 5440bff001209e42ee86c2501d2cd628cf639734c194c11097a862f6759f09a6adef1da0702b77211774f84d5d9d4ee55e89085b39d8a911811def1bd07ac8e2
DIST nltk-indian-20200312.zip 199187 BLAKE2B 9f70f1f28bcb96d828ca2ed96d6f89cd8eca8391b64842c742e940aaf59f31e142a4919ab5a21bce954bbe2ed85add162ac67c9549758f9aa44b3e90d45d44a7 SHA512 08e9342567bcc607e9007aea8415af2cad1f8a289104b47a61bf569f1391629ebbe25f19aacf53e2b14725edfdf18076331f044e1c2e42d07c96d5c875ff8f9e
DIST nltk-jeita-20200312.zip 16531215 BLAKE2B 7cf98b389d5c4b41d9d613c14c5116b83c520cfe7418ef982a6acc1294d8ecdb8a5d6d396409c9f13ca83df999793619d5eb5483612e87874766afe565c68362 SHA512 491e6dc27d70ebfd139be4dcd179b02c383f38edbe4290aed85638ecd7c987259b025e4f6f3f1d5bfe2b7aa3eff7188109205417d0acb64759988150f2753129
@@ -78,7 +77,6 @@ DIST nltk-senseval-20200312.zip 2151350 BLAKE2B ec21463f7ffe27e3dda47e38e3563464
DIST nltk-sentence_polarity-20200312.zip 490256 BLAKE2B 56a1b4668875d5c73b220710e8af84410a273f394d68d7e9decb47c5370f0f7c1982d37710ab903d78dac0108ec2e1a26e8f1cdd3fee172487397c8b2ae01ee7 SHA512 d3772edc13d46fb835f40a9c6714b95885d3ba2eded37a24c8aad6d3e63ca49a9c79387218e2f67bda2e52d52449daf1dd1e603a4819ae46da78f4e0a38ba599
DIST nltk-sentiwordnet-20200312.zip 4686546 BLAKE2B 98a45d9a5be971bd9e76ad5c154359c1800aa96ab1e8e301215e435cae6d659cc0fb38604435da4e9a391741a72ef7fcb842a1cf119348a7039f717843035f12 SHA512 cd0ff31abec3b85e3114546918c9152360e3da45fbbaf7b4c76a13a6d288f1d9ea7a4633638080cd0205cc293b4123445106e9b5daddb2ae379d6a08d1c992a5
DIST nltk-shakespeare-20200312.zip 475458 BLAKE2B c60acbf2d90f5fffdae43bdf85191dbd694ec5ae19e37b145ad04af6d8481988f0c2ce644baff2e31707e86d95a9b3b03c1da4faaa46be92da2291cce565955e SHA512 72cbbdb237b78df1f6bf27b5504606f805433746b9862dd1ce544ab712783a41fb165f3a4f282a7f1bd58c7dfe6036c14ecb07a82c109f6d326eb95e470002fb
-DIST nltk-sinica_treebank-20200312.zip 899237 BLAKE2B e074dbdc7f1fe5958109f8ea019bead946bd2e3cd1c0849cf3ad49eaa34a1989baee7935a70e9103c121e5d375954d1dc53e4a1599e394ce6c2fda4e98769645 SHA512 724fb0905cbd0f903c3a75d70aee8a3e6f29e62971c1501efd8bfb3f4c6619e5fd8a285648e7792ea8e86bbda0a2afc6eecae45ad9f361983bf83b756ab52960
DIST nltk-sinica_treebank-20211221.zip 906706 BLAKE2B 0c00bc7d39765b86eb99f55e82522d390cf0270fd2e2688d5825481b6f11e54aaa71b3f9ac95af170be543353746777da130bb6b294b0529330c712f9edaaab8 SHA512 22d44e23e89691f775a0e54971fada692aecd7cfa5181d49e9a53d0a96dc3be1beaa1e1ec6840f9b2756c5c16c93f96b571f6b08c175701485afc0c32261cac8
DIST nltk-smultron-20200312.zip 166207 BLAKE2B d0c3e75dd108965e260d913e0c02137da8707a751a4f53a1f59ff6a26af9997e2f0193980b5087277b87b74d9aab3e3c8d066d7264311cec2a94f0f9d21caf44 SHA512 8872717994269caff4ccf3f52cef273b2ef007efeb8ab89392cb4db3b8e3ba4b8859ab2fd70762b8941bee4558aebba3021869772be17a64e1056968aa605e19
DIST nltk-snowball_data-20200312.zip 6785405 BLAKE2B 44c10439b142540ac7eece967efa1431fd8f45342f0a90875dacf29ad374fe4c7d30af11d42ba45e0f1ec1836d56b2ff684ee352c5e8536cfb5db5eb7632285b SHA512 6c8a9259d88f6f7f499867d83b731de99d7fa4e8827ecedf836f653fc1a810efa9f6c5c6e2720a9e6610bc00978956b6a119bd08b70e3e241c4e9faccddd81d8
@@ -106,7 +104,6 @@ DIST nltk-wmt15_eval-20200312.zip 383096 BLAKE2B 119943db4240171077569b3302c6786
DIST nltk-word2vec_sample-20200312.zip 49396025 BLAKE2B 0512b9bb7121a528190079f578e82b4e8f8021bfc0062cfa5613d260f3eee17460aecfeaacb65d950e79d27a653c78633a88c3638ec16377e2dbc3006387ebaa SHA512 09c30a4ab8f9fb6a5b36974b5953260d01cb4f285827fb90a374d054ad775ac978602ab56c452f46f4f8601312e232fd739d2f54dafa44ab8b7b01831cf0d9d1
DIST nltk-wordnet-20200312.zip 10775600 BLAKE2B dac56a8fb1fa6882b1871c394ad2acb2d3be739c424570e27c89fb6983df5f896a8f359092ba82752ddfc0531d83563a219e85f80124202f29bda93181efe4dc SHA512 1923a8bcd56fa0b9a9de91f53070dce28c3a7efbab11d2ef55c87134b1bf30de0f40abab59c39eb15dce54aec9491d8a5a259de212ff4cb25cde0ad09317009a
DIST nltk-wordnet2021-20211221.zip 11332750 BLAKE2B f4296728b803ef5a48cf312e2dd1471e8a240a0486e0c45b13c4c9998b8577ac30f87d151ffb2957e319e2b7b82706eba82b7b21530bfb3600d7cff2f4aefa0b SHA512 f27e847e0fd3fda83848af8bf94793d0e89c5f89b7868e222ba60be59122587f524e124e9d84f0b904b3a48808aae1c7d849f5851e2f3119ec99b3c30baac400
-DIST nltk-wordnet31-20211023.zip 11055271 BLAKE2B e41a1951af5a71c9506d1e948b860574c94ab0ef31c1789a7e7bfb29c6dccea5b1d8895007631f7b595e9f90306365b5042e7a80dc6e1364fdbf4a5f0cba3b28 SHA512 a86091bd55e3a706892550b232be8f5199092623f1f8305d8c9be967a8527fe7d4ecb6250c369b229fdf52b6f3008106b758adc355fa2ad08b5b0cf2a458c173
DIST nltk-wordnet31-20211221.zip 11058667 BLAKE2B fb2dca62f602e05226dd62802f42e5f74d11cc535e71b1dc634a202a8c52a7912b1eb0b71edb3c564520c7c05251d9a9928abe88398b816a43035a6db4bda1b3 SHA512 dffea47ded52a042594dced94bc877686d78e6dc9d5a9db5f2a1718190f411cc991b04028d0a4407a8354afd695258e2a5ae881d5f6d9eaf0c3d30de1a8cbf96
DIST nltk-wordnet_ic-20200312.zip 12056682 BLAKE2B c2dc2a646015b23699a72f636b588ec5718c70e6941d9d56863257e1e0396c8cf59ac1dc6ed74e5d7f0c2ee9129d63221a03967bf66a3d335e99160f295ed44d SHA512 1c94451a13af6c76bff60a0cab2e70402a3d9abd2e8fc62a5473f24ab4229feb0afe4faa8d389734697a6cf86d2c8b1dc700bb3afa3cbc279b75d7e0ec19fc6d
DIST nltk-words-20200312.zip 757777 BLAKE2B eaaaaab6c26e206e9b6ce45daf779e3cc6706a06132afeabf013026d0009caee2d678f3c4ea9125b9654f7143bef29ec7a5706b79e5650ea556c6821b7754e6f SHA512 2810f05d3fc7ee6b6f8636fa1ff7b4e8c8cdac12b415cc54d15c69102290122ea138ec4fa36cb483f790c1ac10b0f83ae4c2c3e0e8df7e67e90e962ee5dbb0be
diff --git a/dev-python/nltk-data/nltk-data-20211023.ebuild b/dev-python/nltk-data/nltk-data-20211023.ebuild
deleted file mode 100644
index df8437c785be..000000000000
--- a/dev-python/nltk-data/nltk-data-20211023.ebuild
+++ /dev/null
@@ -1,203 +0,0 @@
-# Copyright 2020-2021 Gentoo Authors
-# Distributed under the terms of the GNU General Public License v2
-
-EAPI=7
-
-inherit check-reqs
-
-DESCRIPTION="Data files for NLTK"
-HOMEPAGE="https://www.nltk.org/nltk_data/"
-
-# at least some of the files have poorly documented licenses
-# TODO: create a USE flag for free-ish subset
-LICENSE="all-rights-reserved"
-SLOT="0"
-KEYWORDS="amd64 x86"
-IUSE="extra"
-RESTRICT="bindist mirror"
-
-BDEPEND="app-arch/unzip"
-
-PACKAGES_ZIP_2020=(
- # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=0]' -v @subdir -o "/" -v @id -n - | sort
- corpora/comtrans
- corpora/conll2007
- corpora/jeita
- corpora/knbc
- corpora/machado
- corpora/masc_tagged
- corpora/nombank.1.0
- corpora/panlex_swadesh
- corpora/propbank
- corpora/reuters
- corpora/semcor
- corpora/universal_treebanks_v20
- sentiment/vader_lexicon
- stemmers/snowball_data
-)
-
-PACKAGES_UNPACK_2020=(
- # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=1]' -v @subdir -o "/" -v @id -n - | sort
- corpora/abc
- corpora/alpino
- corpora/brown
- corpora/cess_cat
- corpora/cess_esp
- corpora/chat80
- corpora/city_database
- corpora/cmudict
- corpora/comparative_sentences
- corpora/conll2000
- corpora/conll2002
- corpora/crubadan
- corpora/dependency_treebank
- corpora/dolch
- corpora/europarl_raw
- corpora/floresta
- corpora/framenet_v15
- corpora/framenet_v17
- corpora/gazetteers
- corpora/genesis
- corpora/gutenberg
- corpora/ieer
- corpora/inaugural
- corpora/indian
- corpora/lin_thesaurus
- corpora/mac_morpho
- corpora/movie_reviews
- corpora/mte_teip5
- corpora/names
- corpora/nonbreaking_prefixes
- corpora/nps_chat
- corpora/omw
- corpora/opinion_lexicon
- corpora/pl196x
- corpora/ppattach
- corpora/product_reviews_1
- corpora/product_reviews_2
- corpora/pros_cons
- corpora/ptb
- corpora/qc
- corpora/rte
- corpora/senseval
- corpora/sentence_polarity
- corpora/sentiwordnet
- corpora/shakespeare
- corpora/sinica_treebank
- corpora/state_union
- corpora/subjectivity
- corpora/swadesh
- corpora/switchboard
- corpora/timit
- corpora/toolbox
- corpora/treebank
- corpora/twitter_samples
- corpora/udhr
- corpora/udhr2
- corpora/verbnet
- corpora/webtext
- corpora/wordnet
- corpora/wordnet_ic
- corpora/words
- grammars/book_grammars
- grammars/large_grammars
- grammars/sample_grammars
- misc/perluniprops
- models/bllip_wsj_no_aux
- models/moses_sample
- models/wmt15_eval
- models/word2vec_sample
- stemmers/porter_test
- stemmers/rslp
- taggers/averaged_perceptron_tagger
- taggers/averaged_perceptron_tagger_ru
- taggers/universal_tagset
- tokenizers/punkt
-)
-
-PACKAGES_UNPACK_2021=(
- corpora/stopwords
- corpora/wordnet31
-)
-
-PACKAGES_UNPACK_EXTRA_2020=(
- chunkers/maxent_ne_chunker
- corpora/biocreative_ppi
- corpora/brown_tei
- corpora/kimmo
- corpora/paradigms
- corpora/pe08
- corpora/pil
- corpora/problem_reports
- corpora/smultron
- corpora/unicode_samples
- corpora/verbnet3
- corpora/ycoe
- grammars/basque_grammars
- grammars/spanish_grammars
- help/tagsets
- misc/mwa_ppdb
- taggers/maxent_treebank_pos_tagger
-)
-
-add_data() {
- local x version=${1}
- shift
-
- for x; do
- SRC_URI+="
- https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/${x}.zip
- -> nltk-${x#*/}-${version}.zip"
- done
-}
-
-add_data 20200312 "${PACKAGES_ZIP_2020[@]}" "${PACKAGES_UNPACK_2020[@]}"
-add_data 20211023 "${PACKAGES_UNPACK_2021[@]}"
-SRC_URI+="
- extra? ("
-add_data 20200312 "${PACKAGES_UNPACK_EXTRA_2020[@]}"
-SRC_URI+="
- )"
-
-CHECKREQS_DISK_USR=3G
-CHECKREQS_DISK_BUILD=${CHECKREQS_DISK_USR}
-
-unpack_data() {
- local x version=${1}
- shift
-
- for x; do
- local cat=${x%/*}
- local pkg=${x#*/}
-
- mkdir -p "${S}/${cat}" || die
- cd "${S}/${cat}" || die
- unpack "nltk-${pkg}-${version}.zip"
- done
-}
-
-src_unpack() {
- unpack_data 20200312 "${PACKAGES_UNPACK_2020[@]}"
- unpack_data 20211023 "${PACKAGES_UNPACK_2021[@]}"
- use extra && unpack_data 20200312 "${PACKAGES_UNPACK_EXTRA_2020[@]}"
-}
-
-install_zips() {
- local x version=${1}
- shift
-
- for x; do
- local cat=${x%/*}
- local pkg=${x#*/}
-
- insinto "/usr/share/nltk_data/${cat}"
- newins "${DISTDIR}/nltk-${pkg}-${version}.zip" "${pkg}.zip"
- done
-}
-
-src_install() {
- dodir /usr/share/nltk_data
- mv * "${ED}/usr/share/nltk_data/" || die
-
- install_zips 20200312 "${PACKAGES_ZIP_2020[@]}"
-}
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2022-04-24 11:06 Jakov Smolić
0 siblings, 0 replies; 14+ messages in thread
From: Jakov Smolić @ 2022-04-24 11:06 UTC (permalink / raw
To: gentoo-commits
commit: 6ce93f7d306dc4391ec88bfc1956eb38e469c8c1
Author: Yu Gu <guyu2876 <AT> gmail <DOT> com>
AuthorDate: Thu Apr 21 11:46:38 2022 +0000
Commit: Jakov Smolić <jsmolic <AT> gentoo <DOT> org>
CommitDate: Sun Apr 24 11:05:31 2022 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=6ce93f7d
dev-python/nltk-data: Keyword 20211221 riscv, #839027
Signed-off-by: Yu Gu <guyu2876 <AT> gmail.com>
Signed-off-by: Jakov Smolić <jsmolic <AT> gentoo.org>
dev-python/nltk-data/nltk-data-20211221.ebuild | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/dev-python/nltk-data/nltk-data-20211221.ebuild b/dev-python/nltk-data/nltk-data-20211221.ebuild
index 104917b4e5a0..4eb1cac63622 100644
--- a/dev-python/nltk-data/nltk-data-20211221.ebuild
+++ b/dev-python/nltk-data/nltk-data-20211221.ebuild
@@ -1,4 +1,4 @@
-# Copyright 2020-2021 Gentoo Authors
+# Copyright 2020-2022 Gentoo Authors
# Distributed under the terms of the GNU General Public License v2
EAPI=8
@@ -12,7 +12,7 @@ HOMEPAGE="https://www.nltk.org/nltk_data/"
# TODO: create a USE flag for free-ish subset
LICENSE="all-rights-reserved"
SLOT="0"
-KEYWORDS="amd64 x86"
+KEYWORDS="amd64 ~riscv x86"
IUSE="extra"
RESTRICT="bindist mirror"
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2022-07-07 19:06 Michał Górny
0 siblings, 0 replies; 14+ messages in thread
From: Michał Górny @ 2022-07-07 19:06 UTC (permalink / raw
To: gentoo-commits
commit: 1b704c5d34ba6eda30d9e2811b4bc97a84179071
Author: Michał Górny <mgorny <AT> gentoo <DOT> org>
AuthorDate: Thu Jul 7 15:44:13 2022 +0000
Commit: Michał Górny <mgorny <AT> gentoo <DOT> org>
CommitDate: Thu Jul 7 19:06:39 2022 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=1b704c5d
dev-python/nltk-data: Update to 20220704
Signed-off-by: Michał Górny <mgorny <AT> gentoo.org>
dev-python/nltk-data/Manifest | 3 +
dev-python/nltk-data/nltk-data-20220704.ebuild | 223 +++++++++++++++++++++++++
2 files changed, 226 insertions(+)
diff --git a/dev-python/nltk-data/Manifest b/dev-python/nltk-data/Manifest
index 3a61ebbf461e..4dc34058effa 100644
--- a/dev-python/nltk-data/Manifest
+++ b/dev-python/nltk-data/Manifest
@@ -22,6 +22,7 @@ DIST nltk-crubadan-20200312.zip 5288655 BLAKE2B 2f5103c9801b965ce5d893b690dedc5f
DIST nltk-dependency_treebank-20200312.zip 457429 BLAKE2B a60a95f9bcbb72b4ee19a91c380eb620261687f23bd867ae4cf2d7937b60b8dacafb52c09d7bcbcb9fbbbda4e57954ee55a900d70e69e7e290e53c668dd0775f SHA512 b225ace2e2413f0b9ae0e142d81052e64a1bcacb71caf7b07a577c18dec45dc429116abfd12edd3bb1442a111f4fa6e3e33e2b5f113e26dfc4e917268d821736
DIST nltk-dolch-20200312.zip 2116 BLAKE2B a79b5d5de2ab03be43e5fdb1b34c70a458af6efce0e35f42f3c4d748c84cd950c98981f969fea2b0507ee405438f84b50e526aa94a93f99de7567ccb5c462215 SHA512 29254c95d11fd2001f486a5c682f7ab7f23fa13472ac4142db3702e2fa994f74ffabc2f571f1feb94a2e65fcb8da97c57a519f24504379b0e369fd27f9f867f0
DIST nltk-europarl_raw-20200312.zip 12594977 BLAKE2B 295e274880ae34c9618a70cabc1b1075e35c442158aabe437972caebbf36d09c4f5f5b66d8ae7ac953f4312f9dac8f5e0c6d3346a444afe98ac700fb3e622539 SHA512 47dcd5420df6ab294f75204355cf346257a821d491f65b856c41a77237df1d7aeb352ed1353da55bfb68fa9f0d8782b9aac9c33bd49af3b395cc780dc421e26d
+DIST nltk-extended_omw-20220704.zip 11251284 BLAKE2B 11dafcc49c3608e5178b2cf82a943a95afd900f80308bf40d56b29a4518029832fe52241df728f6c38114e60b9c03617337717d987221918a11810b77b1b33ea SHA512 b1f53bef1cc43a5bf9996cc037364f4668e1055385d43e3276a1c58e4345b6ee09d8b577ccdebe8e65bf3367b8da37c05c6dabd282e469808143805303bfeec5
DIST nltk-floresta-20200312.zip 1882021 BLAKE2B 092886cb15c6215b4fe979ce62a1a137de97c4749c372426211d36117d367697d19ebe6a9337f702ee9dfc133318b1e7de51393c5a41afc31a960e10ebe7c498 SHA512 1a71453b9629d622c3468b1e00501683e0415996a62a30c498486354b82f0b3be76080dddc60117c8c9f29de3b34a2166a5de87376a0daeaf8b92bfe5dac668a
DIST nltk-framenet_v15-20200312.zip 69337891 BLAKE2B 046087cb20013f3a597034458dd09b4cb7378b19f91f5ebf7196a92a02741a67b671f094262ac5d8e0e376743048c56dd1d65e6c37efb3b254bb3f8f6434addf SHA512 4b55a435f17afbb9cf5fb59691a03776784830cfa6f1f7b9c9c64770fa6dbb640bb3256a099328e44a0a6692ed52703b9f253e5229ef71ee300ace34027ba462
DIST nltk-framenet_v17-20200312.zip 99207152 BLAKE2B b1a32c535d37fddba5c77922f3a6097859d8bf59a4bcf37dca4ff394fcf027fc68d781b3b6c0a8d5cd9a423a944687994ec6a2023dd45808dc697b4c90f28286 SHA512 3696c5bb0fdd59058a8723ba3ca962a0d6422b36d8dcfad0427caa21dae9579b85f2f9af431aa1fc1cb76e823676a996c5a26aca99782017270d03ef3bd77f0d
@@ -83,6 +84,7 @@ DIST nltk-snowball_data-20200312.zip 6785405 BLAKE2B 44c10439b142540ac7eece967ef
DIST nltk-spanish_grammars-20200312.zip 4047 BLAKE2B d8a8dbb558850a6a60f1fe5ab0f617f3a0f3c64bc7d49980cf793d374c6679d1bd42afb7e61776737b5eec162f2520abf2ee3acc92ea9ee0f397c3089b3b5b28 SHA512 4513347156e9351c259c0e2448198d68354bbc95e0a54561c31a88f13f333ebcba3e294c820cb62036665f2904eb6a7137546cd580e361c0423c30a8aed950b2
DIST nltk-state_union-20200312.zip 808757 BLAKE2B 2d352af0ced736d3c11a821eaba0b035b3b5b6b0f20db3bd5d4ac2451f99daf68bffd3ad984bd404ecc4a1e67ca9281c529af2cc9e295a3a7330f36ee9640bc8 SHA512 a6fc83e6230e57ba66a7af62f0d2a5d44a14530ed1e0e914590b3f8b5bf939967c126a5e6f6899ba9134843893f65212e836d311109393c1200191a5c3163485
DIST nltk-stopwords-20211023.zip 26220 BLAKE2B 50219508c5fb24c1392064ea0546ca9060829f51689c0d626bb99e3fa8f712df98ed475fc0d27d99f934fb156ce65df91a8e7c22e1b4f16833339fb0aea34a9e SHA512 8308623953560281288b64e695638ca3fa28e1b6201b538d01650d6dfd08e821687217c8d012e93adfa2a48afebdda11af1bb86d638358c2931f36754d1e15ef
+DIST nltk-stopwords-20220704.zip 34276 BLAKE2B 529a26163c01123bb9b800cfc8f7f5a23701246194c0b55012b418384ca48c56eb5c27a5f104e81ba974cec845e7025829e9d396a384d331903982cd29c98623 SHA512 8b76207f53d05c251aac078d9b8537f100a202cc2a43892af0c79da8d99ae8335225a1bc8a56f1be4cd1ef45c9c987b6c9c334e0f9fa59746bb33694c58a556b
DIST nltk-subjectivity-20200312.zip 521628 BLAKE2B 0a8777a5b91b1b825fbde41cc927d496480129f0a810349bddde2036ed16f37611f2bc3b007e74fe36523612529a92433d32d094be72d247f5faef8220c3c491 SHA512 a3cc4d2d20f26c5eabd86fbced2c013e69d46e607013278eb35831a62e57523a17aec1b580ba62c7a867e61a561e1b222d8430f0c1e2d429a9479e12b008c5b7
DIST nltk-swadesh-20200312.zip 22828 BLAKE2B 1cd9d5355c6b53694ec545ca001b0807b4912a7878ba075b0f81ce8b9b22a5c7a18cf52cf2449483a1c89cd20d8d86986dc3d827fa93a7ef5824fddbc0922025 SHA512 90cb32532a5378d05ce34b84b5f8363dbb32f24afac58b0dcc5cdccba98fd7d37def7f4fbe76b11c8c64059bd19df745562bfbf5f4c721d65ce9f4be1348ab76
DIST nltk-switchboard-20200312.zip 791161 BLAKE2B 211116a751ae246fa31b6aca96b396d3642d89ad112588a09f8d91a5b76dc41c7fb4d36c16c6358cd8e0da8056bc83598ed0dc635cf7b1fd8469a0e80b5f1761 SHA512 690e5392dc082c4ac550bde2848aa65117e7a25cbc4bce0887581c531d03be64e21f044ac0a3286648255f0edd7766b1161f5575ad5fd680c7303b34c3226b8f
@@ -95,6 +97,7 @@ DIST nltk-udhr-20200312.zip 1170177 BLAKE2B cab5f8b4ffa9c0c303620a499f61bbe65454
DIST nltk-udhr2-20200312.zip 1653975 BLAKE2B 43ee8c9c490a6a8ee5f0d9273bab83f77ba3d977da7eaeded3239b8c16f3432ef36478d0d290b1df9fe9af4e0c5de1902b8fe699788b7681003e5c8c426574e3 SHA512 eda644aedba1d0b9d2605dc6b1c8c77943692b4404c4a9c904c82956432c9ba7e27bb0f496e63d3eef7f29b4b34eb929f8d9228b4fd56613d5d4f743fa1d87e0
DIST nltk-unicode_samples-20200312.zip 1212 BLAKE2B 715c6030baf56d766dce7b920500c29dc98ce4edc9b41fb4c858960a4630469fe5a269be6d0b2d33b54ee0208014b1e43f5d9ec49b878442963ea21e4e3acda6 SHA512 88cac1ef0f43fc491285c0638807f88f52d1baba8964cb2cf37c4ee3e8cd8c080071f108fb553fdc14ed17cb0e80343097c08ac7c1613b0abbd84f559e38fba8
DIST nltk-universal_tagset-20200312.zip 19095 BLAKE2B 179297dac3a6a30a4a55e8affb480cc95a7a1dd1ee0ac6a0d4381c655c5dbf065e16f750ebd28846299f114d3355a9edf6cbed72242ebf5fd257f339fcd8f40d SHA512 ca8915e8c3bcb8515ffe0da21fb7066d213683ff702bf247d9fa9f7c966c66d06facafef1d600b7223d7aa1e4dfba1a2a75e1bb05b2c5d78beb312b8551d3fed
+DIST nltk-universal_tagset-20220704.zip 19095 BLAKE2B 829bb169ec42005fd5c0bc827d0820eab49acce562240b942dcccc4d3bd96b3f17d24fe46f21d528264ebe976df79e2bb00da33a889759cc20cbb06f40a6ac41 SHA512 2a0094744c0e697e0cbc2c49002f41d52973a87dc940636b5edb1a162f7ac4f165911555be590c5b66d69cede918ed9cf2d3ccd59f91286eba274ef44c8df28c
DIST nltk-universal_treebanks_v20-20200312.zip 25908853 BLAKE2B 20f8a6c0e759ab15560e71f48527e26a0dac554b8996dcebcacd2c5d5e3cea8947481734a03680843176025fde6d5e484e9414dd239043dca380c79053d71d8f SHA512 3dcabe02ff8bb1044f7d9e1c5af294af05291ff33a6c8dc16d3a3c855ccbf3a8261d92ea926e15e0d7f9228ae81e7863a283b156812ad30575c9ea947ecbcc55
DIST nltk-vader_lexicon-20200312.zip 90486 BLAKE2B aaa4a07a600d6ef53373250afd62a1a5e76cbeb28b56c06879d07a6cea593d7041300e6eaf2ce39b87dc0c04a536fae111df80604e7f517878e680fb9db16019 SHA512 55d41757736806a2a0dbcaedef28000b4461eb44dce308c9e06315feb55f843914f743c37b6cf220fc9e61c04f8f7f41dd338438059e2e803fc2b4e91f739923
DIST nltk-verbnet-20200312.zip 323661 BLAKE2B af5b26a63853d1bc3fc99c9cba3c50a82bc148cb6d95a26045c9aa29ba83408db76b098354e99f07696d5e2373ed9d8a35ed8504ac9b20e1b4d16252a238bc85 SHA512 00368fb2d5cf1bf818f1d72762d2c40a98067937d71a65ece2ca0f4bfab6f8957b32ba66741901af34258cf3309d00dae433b1092d0b7adb145a791dc3bf859b
diff --git a/dev-python/nltk-data/nltk-data-20220704.ebuild b/dev-python/nltk-data/nltk-data-20220704.ebuild
new file mode 100644
index 000000000000..fdbb6fd0da4e
--- /dev/null
+++ b/dev-python/nltk-data/nltk-data-20220704.ebuild
@@ -0,0 +1,223 @@
+# Copyright 2020-2022 Gentoo Authors
+# Distributed under the terms of the GNU General Public License v2
+
+EAPI=8
+
+inherit check-reqs
+
+DESCRIPTION="Data files for NLTK"
+HOMEPAGE="https://www.nltk.org/nltk_data/"
+
+# at least some of the files have poorly documented licenses
+# TODO: create a USE flag for free-ish subset
+LICENSE="all-rights-reserved"
+SLOT="0"
+KEYWORDS="amd64 ~riscv x86"
+IUSE="extra"
+RESTRICT="bindist mirror"
+
+BDEPEND="app-arch/unzip"
+
+# https://github.com/nltk/nltk_data/commits/gh-pages
+
+PACKAGES_ZIP_2020=(
+ # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=0]' -v @subdir -o "/" -v @id -n - | sort
+ corpora/comtrans
+ corpora/conll2007
+ corpora/jeita
+ corpora/knbc
+ corpora/machado
+ corpora/masc_tagged
+ corpora/nombank.1.0
+ corpora/panlex_swadesh
+ corpora/propbank
+ corpora/reuters
+ corpora/semcor
+ corpora/universal_treebanks_v20
+ sentiment/vader_lexicon
+ stemmers/snowball_data
+)
+
+PACKAGES_UNPACK_2020=(
+ # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=1]' -v @subdir -o "/" -v @id -n - | sort
+ corpora/abc
+ corpora/alpino
+ corpora/brown
+ corpora/cess_cat
+ corpora/cess_esp
+ corpora/chat80
+ corpora/city_database
+ corpora/cmudict
+ corpora/comparative_sentences
+ corpora/conll2000
+ corpora/conll2002
+ corpora/crubadan
+ corpora/dependency_treebank
+ corpora/dolch
+ corpora/europarl_raw
+ corpora/floresta
+ corpora/framenet_v15
+ corpora/framenet_v17
+ corpora/gazetteers
+ corpora/genesis
+ corpora/gutenberg
+ corpora/ieer
+ corpora/indian
+ corpora/lin_thesaurus
+ corpora/mac_morpho
+ corpora/movie_reviews
+ corpora/mte_teip5
+ corpora/names
+ corpora/nonbreaking_prefixes
+ corpora/nps_chat
+ corpora/omw
+ corpora/opinion_lexicon
+ corpora/pl196x
+ corpora/ppattach
+ corpora/product_reviews_1
+ corpora/product_reviews_2
+ corpora/pros_cons
+ corpora/ptb
+ corpora/qc
+ corpora/rte
+ corpora/senseval
+ corpora/sentence_polarity
+ corpora/sentiwordnet
+ corpora/shakespeare
+ corpora/state_union
+ corpora/subjectivity
+ corpora/swadesh
+ corpora/switchboard
+ corpora/timit
+ corpora/toolbox
+ corpora/treebank
+ corpora/twitter_samples
+ corpora/udhr
+ corpora/udhr2
+ corpora/verbnet
+ corpora/webtext
+ corpora/wordnet
+ corpora/wordnet_ic
+ corpora/words
+ grammars/book_grammars
+ grammars/large_grammars
+ grammars/sample_grammars
+ misc/perluniprops
+ models/bllip_wsj_no_aux
+ models/moses_sample
+ models/wmt15_eval
+ models/word2vec_sample
+ stemmers/porter_test
+ stemmers/rslp
+ taggers/averaged_perceptron_tagger
+ taggers/averaged_perceptron_tagger_ru
+ tokenizers/punkt
+)
+
+PACKAGES_UNPACK_2021_12=(
+ corpora/inaugural
+ corpora/omw-1.4
+ corpora/wordnet2021
+ corpora/wordnet31
+ corpora/sinica_treebank
+)
+
+PACKAGES_UNPACK_2022=(
+ corpora/stopwords
+ taggers/universal_tagset
+)
+
+PACKAGES_UNPACK_EXTRA_2020=(
+ chunkers/maxent_ne_chunker
+ corpora/biocreative_ppi
+ corpora/brown_tei
+ corpora/kimmo
+ corpora/paradigms
+ corpora/pe08
+ corpora/pil
+ corpora/problem_reports
+ corpora/smultron
+ corpora/unicode_samples
+ corpora/verbnet3
+ corpora/ycoe
+ grammars/basque_grammars
+ grammars/spanish_grammars
+ help/tagsets
+ misc/mwa_ppdb
+ taggers/maxent_treebank_pos_tagger
+)
+
+PACKAGES_ZIP_EXTRA_2022=(
+ corpora/extended_omw
+)
+
+add_data() {
+ local x version=${1}
+ shift
+
+ for x; do
+ SRC_URI+="
+ https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/${x}.zip
+ -> nltk-${x#*/}-${version}.zip"
+ done
+}
+
+add_data 20200312 "${PACKAGES_ZIP_2020[@]}" "${PACKAGES_UNPACK_2020[@]}"
+add_data 20211221 "${PACKAGES_UNPACK_2021_12[@]}"
+add_data 20220704 "${PACKAGES_UNPACK_2022[@]}"
+SRC_URI+="
+ extra? ("
+add_data 20200312 "${PACKAGES_UNPACK_EXTRA_2020[@]}"
+add_data 20220704 "${PACKAGES_ZIP_EXTRA_2022[@]}"
+SRC_URI+="
+ )"
+
+CHECKREQS_DISK_USR=3G
+CHECKREQS_DISK_BUILD=${CHECKREQS_DISK_USR}
+
+unpack_data() {
+ local x version=${1}
+ shift
+
+ for x; do
+ local cat=${x%/*}
+ local pkg=${x#*/}
+
+ mkdir -p "${S}/${cat}" || die
+ cd "${S}/${cat}" || die
+ unpack "nltk-${pkg}-${version}.zip"
+ done
+}
+
+src_unpack() {
+ unpack_data 20200312 "${PACKAGES_UNPACK_2020[@]}"
+ unpack_data 20211023 "${PACKAGES_UNPACK_2021[@]}"
+ unpack_data 20211221 "${PACKAGES_UNPACK_2021_12[@]}"
+ unpack_data 20220704 "${PACKAGES_UNPACK_2022[@]}"
+ if use extra; then
+ unpack_data 20200312 "${PACKAGES_UNPACK_EXTRA_2020[@]}"
+ fi
+}
+
+install_zips() {
+ local x version=${1}
+ shift
+
+ for x; do
+ local cat=${x%/*}
+ local pkg=${x#*/}
+
+ insinto "/usr/share/nltk_data/${cat}"
+ newins "${DISTDIR}/nltk-${pkg}-${version}.zip" "${pkg}.zip"
+ done
+}
+
+src_install() {
+ dodir /usr/share/nltk_data
+ mv * "${ED}/usr/share/nltk_data/" || die
+
+ install_zips 20200312 "${PACKAGES_ZIP_2020[@]}"
+ if use extra; then
+ install_zips 20220704 "${PACKAGES_ZIP_EXTRA_2022[@]}"
+ fi
+}
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2022-07-07 19:06 Michał Górny
0 siblings, 0 replies; 14+ messages in thread
From: Michał Górny @ 2022-07-07 19:06 UTC (permalink / raw
To: gentoo-commits
commit: f926c209b2283f63cc9e96edc88e85d902b08b5b
Author: Michał Górny <mgorny <AT> gentoo <DOT> org>
AuthorDate: Thu Jul 7 15:44:18 2022 +0000
Commit: Michał Górny <mgorny <AT> gentoo <DOT> org>
CommitDate: Thu Jul 7 19:06:40 2022 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=f926c209
dev-python/nltk-data: Remove non-fetchable version
Closes: https://bugs.gentoo.org/856838
Signed-off-by: Michał Górny <mgorny <AT> gentoo.org>
dev-python/nltk-data/Manifest | 2 -
dev-python/nltk-data/nltk-data-20211221.ebuild | 210 -------------------------
2 files changed, 212 deletions(-)
diff --git a/dev-python/nltk-data/Manifest b/dev-python/nltk-data/Manifest
index 4dc34058effa..4b5a3ffbec68 100644
--- a/dev-python/nltk-data/Manifest
+++ b/dev-python/nltk-data/Manifest
@@ -83,7 +83,6 @@ DIST nltk-smultron-20200312.zip 166207 BLAKE2B d0c3e75dd108965e260d913e0c02137da
DIST nltk-snowball_data-20200312.zip 6785405 BLAKE2B 44c10439b142540ac7eece967efa1431fd8f45342f0a90875dacf29ad374fe4c7d30af11d42ba45e0f1ec1836d56b2ff684ee352c5e8536cfb5db5eb7632285b SHA512 6c8a9259d88f6f7f499867d83b731de99d7fa4e8827ecedf836f653fc1a810efa9f6c5c6e2720a9e6610bc00978956b6a119bd08b70e3e241c4e9faccddd81d8
DIST nltk-spanish_grammars-20200312.zip 4047 BLAKE2B d8a8dbb558850a6a60f1fe5ab0f617f3a0f3c64bc7d49980cf793d374c6679d1bd42afb7e61776737b5eec162f2520abf2ee3acc92ea9ee0f397c3089b3b5b28 SHA512 4513347156e9351c259c0e2448198d68354bbc95e0a54561c31a88f13f333ebcba3e294c820cb62036665f2904eb6a7137546cd580e361c0423c30a8aed950b2
DIST nltk-state_union-20200312.zip 808757 BLAKE2B 2d352af0ced736d3c11a821eaba0b035b3b5b6b0f20db3bd5d4ac2451f99daf68bffd3ad984bd404ecc4a1e67ca9281c529af2cc9e295a3a7330f36ee9640bc8 SHA512 a6fc83e6230e57ba66a7af62f0d2a5d44a14530ed1e0e914590b3f8b5bf939967c126a5e6f6899ba9134843893f65212e836d311109393c1200191a5c3163485
-DIST nltk-stopwords-20211023.zip 26220 BLAKE2B 50219508c5fb24c1392064ea0546ca9060829f51689c0d626bb99e3fa8f712df98ed475fc0d27d99f934fb156ce65df91a8e7c22e1b4f16833339fb0aea34a9e SHA512 8308623953560281288b64e695638ca3fa28e1b6201b538d01650d6dfd08e821687217c8d012e93adfa2a48afebdda11af1bb86d638358c2931f36754d1e15ef
DIST nltk-stopwords-20220704.zip 34276 BLAKE2B 529a26163c01123bb9b800cfc8f7f5a23701246194c0b55012b418384ca48c56eb5c27a5f104e81ba974cec845e7025829e9d396a384d331903982cd29c98623 SHA512 8b76207f53d05c251aac078d9b8537f100a202cc2a43892af0c79da8d99ae8335225a1bc8a56f1be4cd1ef45c9c987b6c9c334e0f9fa59746bb33694c58a556b
DIST nltk-subjectivity-20200312.zip 521628 BLAKE2B 0a8777a5b91b1b825fbde41cc927d496480129f0a810349bddde2036ed16f37611f2bc3b007e74fe36523612529a92433d32d094be72d247f5faef8220c3c491 SHA512 a3cc4d2d20f26c5eabd86fbced2c013e69d46e607013278eb35831a62e57523a17aec1b580ba62c7a867e61a561e1b222d8430f0c1e2d429a9479e12b008c5b7
DIST nltk-swadesh-20200312.zip 22828 BLAKE2B 1cd9d5355c6b53694ec545ca001b0807b4912a7878ba075b0f81ce8b9b22a5c7a18cf52cf2449483a1c89cd20d8d86986dc3d827fa93a7ef5824fddbc0922025 SHA512 90cb32532a5378d05ce34b84b5f8363dbb32f24afac58b0dcc5cdccba98fd7d37def7f4fbe76b11c8c64059bd19df745562bfbf5f4c721d65ce9f4be1348ab76
@@ -96,7 +95,6 @@ DIST nltk-twitter_samples-20200312.zip 16007673 BLAKE2B dc100181ff129c2e9774c643
DIST nltk-udhr-20200312.zip 1170177 BLAKE2B cab5f8b4ffa9c0c303620a499f61bbe654545a5689aaa710df743054836a7843e826d97ee91f7b84ba2dcd32ffce432f1932c53797e791de2776eaf183870988 SHA512 730f83cb663974edc099427fad0a3c1cd16769f5ef82446a78aa0e12f01b68a4c52037bdb5e3717dd6b08c42655b03d9d0f2f5869c01b117ae2ab8a428df0dcc
DIST nltk-udhr2-20200312.zip 1653975 BLAKE2B 43ee8c9c490a6a8ee5f0d9273bab83f77ba3d977da7eaeded3239b8c16f3432ef36478d0d290b1df9fe9af4e0c5de1902b8fe699788b7681003e5c8c426574e3 SHA512 eda644aedba1d0b9d2605dc6b1c8c77943692b4404c4a9c904c82956432c9ba7e27bb0f496e63d3eef7f29b4b34eb929f8d9228b4fd56613d5d4f743fa1d87e0
DIST nltk-unicode_samples-20200312.zip 1212 BLAKE2B 715c6030baf56d766dce7b920500c29dc98ce4edc9b41fb4c858960a4630469fe5a269be6d0b2d33b54ee0208014b1e43f5d9ec49b878442963ea21e4e3acda6 SHA512 88cac1ef0f43fc491285c0638807f88f52d1baba8964cb2cf37c4ee3e8cd8c080071f108fb553fdc14ed17cb0e80343097c08ac7c1613b0abbd84f559e38fba8
-DIST nltk-universal_tagset-20200312.zip 19095 BLAKE2B 179297dac3a6a30a4a55e8affb480cc95a7a1dd1ee0ac6a0d4381c655c5dbf065e16f750ebd28846299f114d3355a9edf6cbed72242ebf5fd257f339fcd8f40d SHA512 ca8915e8c3bcb8515ffe0da21fb7066d213683ff702bf247d9fa9f7c966c66d06facafef1d600b7223d7aa1e4dfba1a2a75e1bb05b2c5d78beb312b8551d3fed
DIST nltk-universal_tagset-20220704.zip 19095 BLAKE2B 829bb169ec42005fd5c0bc827d0820eab49acce562240b942dcccc4d3bd96b3f17d24fe46f21d528264ebe976df79e2bb00da33a889759cc20cbb06f40a6ac41 SHA512 2a0094744c0e697e0cbc2c49002f41d52973a87dc940636b5edb1a162f7ac4f165911555be590c5b66d69cede918ed9cf2d3ccd59f91286eba274ef44c8df28c
DIST nltk-universal_treebanks_v20-20200312.zip 25908853 BLAKE2B 20f8a6c0e759ab15560e71f48527e26a0dac554b8996dcebcacd2c5d5e3cea8947481734a03680843176025fde6d5e484e9414dd239043dca380c79053d71d8f SHA512 3dcabe02ff8bb1044f7d9e1c5af294af05291ff33a6c8dc16d3a3c855ccbf3a8261d92ea926e15e0d7f9228ae81e7863a283b156812ad30575c9ea947ecbcc55
DIST nltk-vader_lexicon-20200312.zip 90486 BLAKE2B aaa4a07a600d6ef53373250afd62a1a5e76cbeb28b56c06879d07a6cea593d7041300e6eaf2ce39b87dc0c04a536fae111df80604e7f517878e680fb9db16019 SHA512 55d41757736806a2a0dbcaedef28000b4461eb44dce308c9e06315feb55f843914f743c37b6cf220fc9e61c04f8f7f41dd338438059e2e803fc2b4e91f739923
diff --git a/dev-python/nltk-data/nltk-data-20211221.ebuild b/dev-python/nltk-data/nltk-data-20211221.ebuild
deleted file mode 100644
index 4eb1cac63622..000000000000
--- a/dev-python/nltk-data/nltk-data-20211221.ebuild
+++ /dev/null
@@ -1,210 +0,0 @@
-# Copyright 2020-2022 Gentoo Authors
-# Distributed under the terms of the GNU General Public License v2
-
-EAPI=8
-
-inherit check-reqs
-
-DESCRIPTION="Data files for NLTK"
-HOMEPAGE="https://www.nltk.org/nltk_data/"
-
-# at least some of the files have poorly documented licenses
-# TODO: create a USE flag for free-ish subset
-LICENSE="all-rights-reserved"
-SLOT="0"
-KEYWORDS="amd64 ~riscv x86"
-IUSE="extra"
-RESTRICT="bindist mirror"
-
-BDEPEND="app-arch/unzip"
-
-PACKAGES_ZIP_2020=(
- # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=0]' -v @subdir -o "/" -v @id -n - | sort
- corpora/comtrans
- corpora/conll2007
- corpora/jeita
- corpora/knbc
- corpora/machado
- corpora/masc_tagged
- corpora/nombank.1.0
- corpora/panlex_swadesh
- corpora/propbank
- corpora/reuters
- corpora/semcor
- corpora/universal_treebanks_v20
- sentiment/vader_lexicon
- stemmers/snowball_data
-)
-
-PACKAGES_UNPACK_2020=(
- # wget -O - https://www.nltk.org/nltk_data/ | xml sel -t -m '//package[@unzip=1]' -v @subdir -o "/" -v @id -n - | sort
- corpora/abc
- corpora/alpino
- corpora/brown
- corpora/cess_cat
- corpora/cess_esp
- corpora/chat80
- corpora/city_database
- corpora/cmudict
- corpora/comparative_sentences
- corpora/conll2000
- corpora/conll2002
- corpora/crubadan
- corpora/dependency_treebank
- corpora/dolch
- corpora/europarl_raw
- corpora/floresta
- corpora/framenet_v15
- corpora/framenet_v17
- corpora/gazetteers
- corpora/genesis
- corpora/gutenberg
- corpora/ieer
- corpora/indian
- corpora/lin_thesaurus
- corpora/mac_morpho
- corpora/movie_reviews
- corpora/mte_teip5
- corpora/names
- corpora/nonbreaking_prefixes
- corpora/nps_chat
- corpora/omw
- corpora/opinion_lexicon
- corpora/pl196x
- corpora/ppattach
- corpora/product_reviews_1
- corpora/product_reviews_2
- corpora/pros_cons
- corpora/ptb
- corpora/qc
- corpora/rte
- corpora/senseval
- corpora/sentence_polarity
- corpora/sentiwordnet
- corpora/shakespeare
- corpora/state_union
- corpora/subjectivity
- corpora/swadesh
- corpora/switchboard
- corpora/timit
- corpora/toolbox
- corpora/treebank
- corpora/twitter_samples
- corpora/udhr
- corpora/udhr2
- corpora/verbnet
- corpora/webtext
- corpora/wordnet
- corpora/wordnet_ic
- corpora/words
- grammars/book_grammars
- grammars/large_grammars
- grammars/sample_grammars
- misc/perluniprops
- models/bllip_wsj_no_aux
- models/moses_sample
- models/wmt15_eval
- models/word2vec_sample
- stemmers/porter_test
- stemmers/rslp
- taggers/averaged_perceptron_tagger
- taggers/averaged_perceptron_tagger_ru
- taggers/universal_tagset
- tokenizers/punkt
-)
-
-PACKAGES_UNPACK_2021=(
- corpora/stopwords
-)
-
-PACKAGES_UNPACK_2021_12=(
- corpora/inaugural
- corpora/omw-1.4
- corpora/sinica_treebank
- corpora/wordnet2021
- corpora/wordnet31
-)
-
-PACKAGES_UNPACK_EXTRA_2020=(
- chunkers/maxent_ne_chunker
- corpora/biocreative_ppi
- corpora/brown_tei
- corpora/kimmo
- corpora/paradigms
- corpora/pe08
- corpora/pil
- corpora/problem_reports
- corpora/smultron
- corpora/unicode_samples
- corpora/verbnet3
- corpora/ycoe
- grammars/basque_grammars
- grammars/spanish_grammars
- help/tagsets
- misc/mwa_ppdb
- taggers/maxent_treebank_pos_tagger
-)
-
-add_data() {
- local x version=${1}
- shift
-
- for x; do
- SRC_URI+="
- https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/${x}.zip
- -> nltk-${x#*/}-${version}.zip"
- done
-}
-
-add_data 20200312 "${PACKAGES_ZIP_2020[@]}" "${PACKAGES_UNPACK_2020[@]}"
-add_data 20211023 "${PACKAGES_UNPACK_2021[@]}"
-add_data 20211221 "${PACKAGES_UNPACK_2021_12[@]}"
-SRC_URI+="
- extra? ("
-add_data 20200312 "${PACKAGES_UNPACK_EXTRA_2020[@]}"
-SRC_URI+="
- )"
-
-CHECKREQS_DISK_USR=3G
-CHECKREQS_DISK_BUILD=${CHECKREQS_DISK_USR}
-
-unpack_data() {
- local x version=${1}
- shift
-
- for x; do
- local cat=${x%/*}
- local pkg=${x#*/}
-
- mkdir -p "${S}/${cat}" || die
- cd "${S}/${cat}" || die
- unpack "nltk-${pkg}-${version}.zip"
- done
-}
-
-src_unpack() {
- unpack_data 20200312 "${PACKAGES_UNPACK_2020[@]}"
- unpack_data 20211023 "${PACKAGES_UNPACK_2021[@]}"
- unpack_data 20211221 "${PACKAGES_UNPACK_2021_12[@]}"
- use extra && unpack_data 20200312 "${PACKAGES_UNPACK_EXTRA_2020[@]}"
-}
-
-install_zips() {
- local x version=${1}
- shift
-
- for x; do
- local cat=${x%/*}
- local pkg=${x#*/}
-
- insinto "/usr/share/nltk_data/${cat}"
- newins "${DISTDIR}/nltk-${pkg}-${version}.zip" "${pkg}.zip"
- done
-}
-
-src_install() {
- dodir /usr/share/nltk_data
- mv * "${ED}/usr/share/nltk_data/" || die
-
- install_zips 20200312 "${PACKAGES_ZIP_2020[@]}"
-}
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2022-11-08 5:17 Michał Górny
0 siblings, 0 replies; 14+ messages in thread
From: Michał Górny @ 2022-11-08 5:17 UTC (permalink / raw
To: gentoo-commits
commit: e0e00d7718b35b3261959df268a6318e5911e652
Author: Michał Górny <mgorny <AT> gentoo <DOT> org>
AuthorDate: Tue Nov 8 05:16:27 2022 +0000
Commit: Michał Górny <mgorny <AT> gentoo <DOT> org>
CommitDate: Tue Nov 8 05:17:01 2022 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=e0e00d77
dev-python/nltk-data: Bump to 20221108 for changed files
Closes: https://bugs.gentoo.org/865693
Signed-off-by: Michał Górny <mgorny <AT> gentoo.org>
dev-python/nltk-data/Manifest | 2 +-
.../{nltk-data-20220704.ebuild => nltk-data-20221108.ebuild} | 7 ++++++-
2 files changed, 7 insertions(+), 2 deletions(-)
diff --git a/dev-python/nltk-data/Manifest b/dev-python/nltk-data/Manifest
index 4b5a3ffbec68..549228678b2a 100644
--- a/dev-python/nltk-data/Manifest
+++ b/dev-python/nltk-data/Manifest
@@ -67,7 +67,7 @@ DIST nltk-product_reviews_2-20200312.zip 170698 BLAKE2B 395d5ac71b759a546a3e25fb
DIST nltk-propbank-20200312.zip 5323498 BLAKE2B 7abde7caca8fa267bdcfdacc507ecef133d80989909563380f9ac296213ddd94680c25b0e9eb339d2ed7f851eda185966f6be63ed86fae5d9d397d8758ec75bf SHA512 6bb6f88648c1f6044d30dbc56ce1924240ddc1420c7abe8ebecf3684d69bc2dc20f1c575a6ae368a39fff40c5ef09f80d94bf317a37301639321085f102b54bf
DIST nltk-pros_cons-20200312.zip 746276 BLAKE2B 7a1aec593396f25c75949bde9627b0568e4d406fb2d42ccb6e7be6ff7d3b118bdfbf34b574e40cdb27d8e65e5a370f820c839bfbc0140b2b177c9dbdd89e3516 SHA512 9eea1070b67e648d5e6e84d5a336d95c9eb3192cd0fa4a66f298076ac5907783ecda5cfe1b6e66209a99ec4e326666b14e1cc89ffb4cc8b2b91fed7db77db75f
DIST nltk-ptb-20200312.zip 6289 BLAKE2B cc4c91569a5a9bad0f3cfce8e0d98179a53b3caddfb66ece085b293b882e46328a108c3be86261dfd869ea2fcc33d3687bc41adac3ada88ea748cd3480c88836 SHA512 dd59d84f20ec52c38063f0961954c063f614b94696307a63fae21839c51c81435d0205fa276f6811d9da7fd083e93f51f0603b5c35dcddc539a9bf59b9dffb6e
-DIST nltk-punkt-20200312.zip 13707633 BLAKE2B 22a0b3aec8a72735fb39aeeb596601452a70cd69e91206e94c5204153da8dedea3b7ec00c460fe19962e223adfd9cffc05a00daa4df48e3f2da500da51e99d2f SHA512 d477823c0cab57a374822cf58dd8bad35b86a504a51dda10de33b27cfd2ca4c89bba946c09f1ed3b28ab4d1cc1e5f3d3781c5f78ebe87458e1df9d48c8841782
+DIST nltk-punkt-20221108.zip 13905355 BLAKE2B 648d0997e193cde81f838a67f914355ff107973b8858ac081b11c1ee39d23f0416b39ccee5df63a37574bac6f0e1ac2f50edc4092eda0f4c27d635f2759910a3 SHA512 faaf7f37d18a0ced9fc4fb513d703141115cf5a769531f84c8bb9f227b5ff99cb01957be49bffbf3a7131182d66793282044a431a6bf80b19dd7ce63e542e48e
DIST nltk-qc-20200312.zip 125456 BLAKE2B ae02a2cd727e1832ede5fa5b3ef561059c9c714ced80bca311a360dacec6e176c034d802838335437cb8fa25ea877c3774fbac6790cdf6b9b75a3908e76e2dcd SHA512 8cca288f51164f7a18e1c99f2b6a35a3d890bcc5471b5be26883dd87d9b5e06214d15cc1133598fee330f0daf74ceaaa862410f6d95dfd56842a2cb84dd67390
DIST nltk-reuters-20200312.zip 6378691 BLAKE2B adcb783246f4fd6e73b32fa9419570d9c7e9ecff1c9e2ae39a587f9036ab5eaf9a6ed87b7794dfe2f0ba965cffca68e51c243e3396c179a0282555601caee55f SHA512 21b5a4e114137cd8a52b6e3567c2d72dbe7eac8a3234952896126d9d598fbfa72a7c1e6b7c424b8b016ed345d7350cc562c6d039e2fd6ec0d73c68e13e58605e
DIST nltk-rslp-20200312.zip 3805 BLAKE2B bf4500dd0e28cb4f018b8d13c5b920294163e7a9735dda12935f6b25ce5c71f2181e3bcb33444d490ba7bb1a029ef50dcdb2df4bddb09bfaf9d91b91531b8461 SHA512 83994b9cf2c925e559c68a03abb1499e3cd308fdb5ac6e698c3b5e2458393d9eae2420b02b4c46fd04527913133a1f19b3a2e5312f07e222e8cf072886f94973
diff --git a/dev-python/nltk-data/nltk-data-20220704.ebuild b/dev-python/nltk-data/nltk-data-20221108.ebuild
similarity index 97%
rename from dev-python/nltk-data/nltk-data-20220704.ebuild
rename to dev-python/nltk-data/nltk-data-20221108.ebuild
index fdbb6fd0da4e..26c9531c0a33 100644
--- a/dev-python/nltk-data/nltk-data-20220704.ebuild
+++ b/dev-python/nltk-data/nltk-data-20221108.ebuild
@@ -111,7 +111,6 @@ PACKAGES_UNPACK_2020=(
stemmers/rslp
taggers/averaged_perceptron_tagger
taggers/averaged_perceptron_tagger_ru
- tokenizers/punkt
)
PACKAGES_UNPACK_2021_12=(
@@ -127,6 +126,10 @@ PACKAGES_UNPACK_2022=(
taggers/universal_tagset
)
+PACKAGES_UNPACK_2022_11=(
+ tokenizers/punkt
+)
+
PACKAGES_UNPACK_EXTRA_2020=(
chunkers/maxent_ne_chunker
corpora/biocreative_ppi
@@ -165,6 +168,7 @@ add_data() {
add_data 20200312 "${PACKAGES_ZIP_2020[@]}" "${PACKAGES_UNPACK_2020[@]}"
add_data 20211221 "${PACKAGES_UNPACK_2021_12[@]}"
add_data 20220704 "${PACKAGES_UNPACK_2022[@]}"
+add_data 20221108 "${PACKAGES_UNPACK_2022_11[@]}"
SRC_URI+="
extra? ("
add_data 20200312 "${PACKAGES_UNPACK_EXTRA_2020[@]}"
@@ -194,6 +198,7 @@ src_unpack() {
unpack_data 20211023 "${PACKAGES_UNPACK_2021[@]}"
unpack_data 20211221 "${PACKAGES_UNPACK_2021_12[@]}"
unpack_data 20220704 "${PACKAGES_UNPACK_2022[@]}"
+ unpack_data 20221108 "${PACKAGES_UNPACK_2022_11[@]}"
if use extra; then
unpack_data 20200312 "${PACKAGES_UNPACK_EXTRA_2020[@]}"
fi
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2022-11-19 16:57 Arthur Zamarin
0 siblings, 0 replies; 14+ messages in thread
From: Arthur Zamarin @ 2022-11-19 16:57 UTC (permalink / raw
To: gentoo-commits
commit: fc23b2f56ded733b993a1a5c7a1c045c0d61b7b7
Author: Arthur Zamarin <arthurzam <AT> gentoo <DOT> org>
AuthorDate: Sat Nov 19 16:56:50 2022 +0000
Commit: Arthur Zamarin <arthurzam <AT> gentoo <DOT> org>
CommitDate: Sat Nov 19 16:56:50 2022 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=fc23b2f5
dev-python/nltk-data: Keyword 20221108 ppc64, #872680
Signed-off-by: Arthur Zamarin <arthurzam <AT> gentoo.org>
dev-python/nltk-data/nltk-data-20221108.ebuild | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/dev-python/nltk-data/nltk-data-20221108.ebuild b/dev-python/nltk-data/nltk-data-20221108.ebuild
index 26c9531c0a33..44624c3de360 100644
--- a/dev-python/nltk-data/nltk-data-20221108.ebuild
+++ b/dev-python/nltk-data/nltk-data-20221108.ebuild
@@ -12,7 +12,7 @@ HOMEPAGE="https://www.nltk.org/nltk_data/"
# TODO: create a USE flag for free-ish subset
LICENSE="all-rights-reserved"
SLOT="0"
-KEYWORDS="amd64 ~riscv x86"
+KEYWORDS="amd64 ~ppc64 ~riscv x86"
IUSE="extra"
RESTRICT="bindist mirror"
^ permalink raw reply related [flat|nested] 14+ messages in thread
* [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/
@ 2022-12-06 14:54 Sam James
0 siblings, 0 replies; 14+ messages in thread
From: Sam James @ 2022-12-06 14:54 UTC (permalink / raw
To: gentoo-commits
commit: 0064b2519dafda70c1529ebb2bd396cbeea5d244
Author: Sam James <sam <AT> gentoo <DOT> org>
AuthorDate: Tue Dec 6 13:08:22 2022 +0000
Commit: Sam James <sam <AT> gentoo <DOT> org>
CommitDate: Tue Dec 6 14:54:43 2022 +0000
URL: https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=0064b251
dev-python/nltk-data: link to upstream bug re LICENSE
Signed-off-by: Sam James <sam <AT> gentoo.org>
dev-python/nltk-data/nltk-data-20221108.ebuild | 1 +
1 file changed, 1 insertion(+)
diff --git a/dev-python/nltk-data/nltk-data-20221108.ebuild b/dev-python/nltk-data/nltk-data-20221108.ebuild
index 44624c3de360..94c3c1e478d7 100644
--- a/dev-python/nltk-data/nltk-data-20221108.ebuild
+++ b/dev-python/nltk-data/nltk-data-20221108.ebuild
@@ -9,6 +9,7 @@ DESCRIPTION="Data files for NLTK"
HOMEPAGE="https://www.nltk.org/nltk_data/"
# at least some of the files have poorly documented licenses
+# https://github.com/nltk/nltk_data/issues/102
# TODO: create a USE flag for free-ish subset
LICENSE="all-rights-reserved"
SLOT="0"
^ permalink raw reply related [flat|nested] 14+ messages in thread
end of thread, other threads:[~2022-12-06 14:54 UTC | newest]
Thread overview: 14+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2020-04-13 10:32 [gentoo-commits] repo/gentoo:master commit in: dev-python/nltk-data/ Michał Górny
-- strict thread matches above, loose matches on Subject: below --
2022-12-06 14:54 Sam James
2022-11-19 16:57 Arthur Zamarin
2022-11-08 5:17 Michał Górny
2022-07-07 19:06 Michał Górny
2022-07-07 19:06 Michał Górny
2022-04-24 11:06 Jakov Smolić
2021-12-21 9:42 Michał Górny
2021-12-21 9:42 Michał Górny
2021-10-23 20:46 Michał Górny
2020-05-09 11:24 Mikle Kolyada
2020-05-09 11:24 Mikle Kolyada
2020-05-04 11:26 Michał Górny
2020-03-13 9:56 Michał Górny
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox