-
Notifications
You must be signed in to change notification settings - Fork 1
chore(deps): update dependency spacy to v3.8.7 #15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
03f9a4c to
13d7b77
Compare
13d7b77 to
c1827b3
Compare
c1827b3 to
d12d8c6
Compare
d12d8c6 to
1b3f347
Compare
1b3f347 to
f85a417
Compare
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
6 similar comments
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
f85a417 to
1519962
Compare
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
5 similar comments
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
1519962 to
3e0073e
Compare
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
2 similar comments
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
|
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
1008480 to
8ef3ca8
Compare
8ef3ca8 to
c488900
Compare
c488900 to
6cce1f3
Compare
6cce1f3 to
fdf7de7
Compare
fdf7de7 to
9531c81
Compare
9531c81 to
0319160
Compare
0319160 to
c5ecd53
Compare
c5ecd53 to
2cf929f
Compare
2cf929f to
4b9939a
Compare
4b9939a to
d8c3a9c
Compare
d8c3a9c to
563ddec
Compare
563ddec to
1f7b671
Compare
1f7b671 to
6f2333f
Compare
6f2333f to
5d4fd18
Compare
This PR contains the following updates:
== 3.0.6->==3.8.7Release Notes
explosion/spaCy (spacy)
v3.8.7Compare Source
v3.8.6Compare Source
v3.8.5Compare Source
v3.8.4Compare Source
v3.8.3Compare Source
v3.8.2Compare Source
v3.8.1Compare Source
v3.8.0Compare Source
v3.7.6Compare Source
v3.7.5: : Download sanitization, Typer compatibility, and a bugfix for linking gold entitiesCompare Source
✨ New features and improvements
spacy download(#13313).typing-extensions<5.0.0for Python < 3.8 (#13516).🔴 Bug fixes
use_gold_entsbehaviour for EntityLinker.📖 Documentation and examples
MorphAnalysis(#13433).👥 Contributors
@danieldk, @honnibal, @ines, @JoeSchiff, @nokados, @Paillat-dev, @rmitsch, @schorfma, @strickvl, @svlandeg, @ynx0
v3.7.4: : New textcat layers and fo/nn language extensionsCompare Source
✨ New features and improvements
TextCatReduce.v1layer for text classification (#13181).TextCatParametricAttention.v1layer for text classification (#13201).buildmodule for creating model packages by default (#13109).benchmark speedcommand (#13247).🔴 Bug fixes
Language.pipe.Doc.Tokenizer.explainfor special cases with whitespace.SparseLinearlayer.📖 Documentation and examples
trf_dataexamples and the transformer pipeline design section.👥 Contributors
@adrianeboyd, @danieldk, @evornov, @honnibal, @ines, @lise-brinck, @ridge-kimani, @rmitsch, @shadeMe, @svlandeg
v3.7.3Compare Source
v3.7.2: : Fixes for APIs and requirementsCompare Source
✨ New features and improvements
__all__fields (#13063).🔴 Bug fixes
spacy.cli.projectAPI.Anycomparisons forTokenandSpan.📖 Documentation and examples
spacy-llmincluding Azure OpenAI, PaLM, and Mistral support.👥 Contributors
@adrianeboyd, @honnibal, @ines, @rmitsch, @svlandeg
v3.7.1: : Bug fix for spacy.cli module loadingCompare Source
🔴 Bug fixes
spacy.infoto fix availability ofspacy.clifollowingimport spacy(#13040).👥 Contributors
@adrianeboyd, @honnibal, @ines, @svlandeg
v3.7.0: : Trained pipelines using Curated Transformers and support for Python 3.12Compare Source
This release drops support for Python 3.6 and adds support for Python 3.12.
✨ New features and improvements
spacy projectcommands should run as before, just now they're using Weasel under the hood.transformersextra tospacy-transformersv1.3 (#13025).--spans-keyoption for CLI evaluation withspacy benchmark accuracy(#12981).spacy.info(#12962).spacy.training.example(#12801).Language.replace_listeners: Pass the replaced listener and thetok2vecpipe to the callback in order to supportspacy-curated-transformers(#12785).tqdmwithdisable=Noneto disable output in non-interactive environments (#12979).📦 Trained pipelines updates
The transformer-based
trfpipelines have been updated to use our new Curated Transformers library through the Thinc model wrappers and pipeline component from spaCy Curated Transformers.rayextra.spacy projecthas a few backwards incompatibilities due to the transition to the standalone library Weasel, which is not as tightly coupled to spaCy. Weasel produces warnings when it detects older spaCy-specific settings in your environment or project config.spacy_versionconfiguration key has been dropped.check_requirementsconfiguration key has been dropped due to the deprecation ofpkg_resources.SPACY_CONFIG_OVERRIDESenvironment variable is no longer checked. You can set configuration overrides usingWEASEL_CONFIG_OVERRIDES.SPACY_PROJECT_USE_GIT_VERSIONenvironment variable has been dropped.📖 Documentation and examples
👥 Contributors
@adrianeboyd, @bdura, @connorbrinton, @danieldk, @davidberenstein1957, @denizcodeyaa, @eltociear, @evornov, @honnibal, @ines, @jmyerston, @koaning, @magdaaniol, @pdhall99, @ringohoffman, @rmitsch, @senisioi, @shadeMe, @svlandeg, @vinbo8, @wjbmattingly
v3.6.1: : Support for Pydantic v2, find-function CLI and moreCompare Source
✨ New features and improvements
find-functionCLI for finding locations of registered functions (#12757).spacy[cuda12x]forcupy-cuda12x(#12890).init configandtrainCLI (#12173).distutilstosetuptools/sysconfig(#12853).🔴 Bug fixes
<br>tags in displaCy.📖 Documentation and examples
👥 Contributors
@adrianeboyd, @afriedman412, @arplusman, @bdura, @connorbrinton, @honnibal, @ines, @it176131, @pmbaumgartner, @rmitsch, @shadeMe, @svlandeg, @thomashacker, @victorialslocum, @x-tabdeveloping
v3.6.0: : New span finder component and pipelines for SlovenianCompare Source
✨ New features and improvements
span_finderpipeline component to identify overlapping, unlabeled spans (#12507).spacy evaluate --per-component,Language.evaluate(per_component=True)andScorer.score(per_component=True)(#12540).spancat_singlelabelinspacy debug dataCLI (#12749).PhraseMatcherandSpanGroup(#12642, #12714).🔴 Bug fixes
SpanGroupspans come from the current doc.📦 Trained pipelines updates
We have added new pipelines for Slovenian that use the trainable lemmatizer and floret vectors.
sl_core_news_smsl_core_news_mdsl_core_news_lgsl_core_news_trfThe English pipelines have been updated to improve handling of contractions with various apostrophes and to lemmatize "get" as a passive auxiliary.
The Danish pipeline
da_core_news_trfhas been updated to usevesteinn/DanskBERTwith performance improvements across the board.SpanGroupspans are now required to be from the same doc. When initializing aSpanGroup, there is a new check to verify that all added spans refer to the current doc. Without this check, it was possible to run into string store or other errors.📖 Documentation and examples
👥 Contributors
@adrianeboyd, @bdura, @danieldk, @davidberenstein1957, @diyclassics, @essenmitsosse, @honnibal, @ines, @isabelizimm, @jmyerston, @kadarakos, @KennethEnevoldsen, @khursani8, @ljvmiranda921, @rmitsch, @shadeMe, @svlandeg, @tomaarsen, @victorialslocum, @vin-ivar, @ZiadAmerr
v3.5.4: : Bug fixes for overrides with registered functions and sourced components with listenersCompare Source
✨ New features and improvements
🔴 Bug fixes
👥 Contributors
@adrianeboyd, @bdura, @honnibal, @ines, @svlandeg
v3.5.3: : Speed improvements, bug fixes and moreCompare Source
✨ New features and improvements
spancat, in particular on GPU (~10x-30x faster) (#12577).>+,>-,>++,>--) for the dependency matcher (#12528).doc.spansfor displaCy output inspacy benchmark accuracy/spacy evaluate(#12575).MorphAnalysis.get(default=)argument for user-provided default values similar todict(#12545).🔴 Bug fixes
#eggfrom download URLs due to future deprecation inpip.📖 Documentation and examples
👥 Contributors
@adrianeboyd, @andyjessen, @bdura, @davidberenstein1957, @diyclassics, @honnibal, @ines, @kadarakos, @KennethEnevoldsen, @ljvmiranda921, @moxley01, @royashcenazi, @svlandeg, @tanloong, @victorialslocum
v3.5.2: : Pretraining improvements, bug fixes for spans and spancat and moreCompare Source
✨ New features and improvements
spacy pretrain(#12435).model-last.binforspacy pretrain(#12459).Spaninput fordisplacy.parse_deps(#12477).cupyinstall extras.🔴 Bug fixes
Span.sents.spancat_singlelabel.Span.sentswhen the final sentence is the last token in aDoc.Span.kb_idandSpan.idstrings inDocandDocBinserialization.📖 Documentation and examples
👥 Contributors
@adrianeboyd, @BLKSerene, @honnibal, @ines, @kadarakos, @prajakta-1527, @rmitsch, @shadeMe, @sloev, @svlandeg, @thomashacker, @willfrey
v3.5.1: : spancat for multi-class labeling, fixes for textcat+transformers and moreCompare Source
💥 We'd love to hear more about your experience with spaCy! Take our survey here.
✨ New features and improvements
spancat_singlelabelpipeline component for multi-class and non-overlapping span classification. Thespancat_singlelabelcomponent predicts at most one label for each suggested span and adds a new settingallow_overlapto restrict the output to non-overlapping spans (#11365).transformer+ CNN for efficient GPUtextcatwithspacy init config(#11900).spacy debug data(#11419).>+,>-,<+,<-) (#12334).spacy.PlainTextCorpusReader.v1for plain text input (#12122).alignment_modeandspan_idtoSpan.char_span()(#12145, #12196).🔴 Bug fixes
top_k>1in trainable lemmatizer.test_cli_find_threshold()test more robust.registry.find().Matcherpatterns with extension attributes.grcto languages with lexeme norms inspacy-lookups-data.KnowledgeBaseinstances configurable.auto_select_port.InMemoryLookupKB.is_empty.Lexeme.orthandLexeme.lower.PretrainVectors.pkg_resources.📖 Documentation and examples
👥 Contributors
@adrianeboyd, @andyjessen, @danieldk, @essenmitsosse, @honnibal, @ines, @itssimon, @kadarakos, @kwhumphreys, @ljvmiranda921, @pmbaumgartner, @polm, @richardpaulhudson, @rmitsch, @shadeMe, @svlandeg, @tanloong, @thomashacker, @victorialslocum
v3.5.0: : New CLI commands, language updates, bug fixes and much moreCompare Source
✨ New features and improvements
applyCLI command to annotate new documents with a trained pipeline (#11376).benchmarkCLI command to benchmark pipelines. The newbenchmark speedsubcommand measures the speed of a pipeline, thebenchmark accuracysubcommand is a new alias forevaluate(#11902).find-thresholdCLI command to identify an optimal threshold for classification models (#11280).FUZZYMatcheroperator for fuzzy matches based on Levenshtein edit distance. In addition, theFUZZYandREGEXoperators are now supported in combination withIN/NOT_IN. (#11359).typerv0.7.x (#11720),mypy0.990 (#11801) andtyping_extensionsv4.4.x (#12036).spacy.ConsoleLogger.v3with expanded progress tracking (#11972).textcatwithspacy.textcat_scorer.v2(#11696 and #11971) andspacy.textcat_multilabel_scorer.v2(#11820).InMemoryLookupKB(#11268).before_updatecallback that is invoked at the start of each training step (#11739).SpanGroup(#11380).displacy.servewhen the default port is in use (#11948).tok2vecversion (#11618).🔴 Bug fixes
tok2vecortransformerlayer.textcat.Vocab.to_diskrespects the exclude setting forlookupsandvectors.SpanGroupandSpanobjects.The following changes may require you to update code that is using the relevant functionality:
textcatortextcat_multilabelmodel - ensure that values are 0.0 or 1.0 as explained in the docs.KnowledgeBaseis now an abstract class, you should call the constructor of the newInMemoryLookupKBinstead when you want to use spaCy's default KB implementation. If you've written a custom KB that inherits fromKnowledgeBase, you'll need to implement its abstract methods, or alternatively inherit fromInMemoryLookupKBinstead.The following changes may influence the output of your language pipeline or trained models:
pymorphy3(#11345, #11811).tok2vecdefaults in all components (#11618).textcatandtextcat_multilabelcomponents (#11698).textcatandtextcat_multilabelto fix a bug related tothresholdfortextcatand to make it possible to score multipletextcat/textcat_multilabelcomponents in a single pipeline with custom scorers. If no custom scorers are used, thecat_p/r/fscores will now only reflect the final component's labels and performance (#11696, #11820).token_accscore to report the intended measure (# correct tokens / # predicted tokens, the same as in spaCy v2). Thetoken_accscores for v3.5 will be lower for the same performance because they were incorrectly inflated in v3.0-v3.4. Thetoken_p/r/fscores should remain unchanged (#12073).The following functionality will be changed in the near future - so it's best to start updating your scripts now to make them more generic:
masterbranch tomain.📦 Trained pipelines updates
IS_SPACEas atok2vecfeature fortaggerandmorphologizercomponents to improve tagging of non-whitespace vs. whitespace tokens.spacy-transformersv1.2, which uses the exact alignment fromtokenizersfor fast tokenizers instead of the heuristic alignment fromspacy-alignments. For all trained pipelines exceptja_core_news_trf, the alignments between spaCy tokens and transformer tokens may be slightly different. More details about thespacy-transformerschanges in the v1.2.0 release notes.📖 Documentation and examples
biluo_to_iobandiob_to_biluofunctions.👥 Contributors
@aaronzipp, @adrianeboyd, @albertvillanova, @ArchiDevil, @cfuerbachersparks, @damian-romero, @danieldk, @darigovresearch, @DSLituiev, @essenmitsosse, @gremur, @honnibal, @ines, @jmyerston, @JosPolfliet, @kadarakos, @koaning, @kwhumphreys, @ljvmiranda921, @MarcoGorelli, @orglce, @pmbaumgartner, @polm, @richardpaulhudson, @rmitsch, @ryndaniels, @shadeMe, @svlandeg, @thomashacker, @TrellixVulnTeam, @wannaphong, @zhiiw, @zrpxx
v3.4.4: : Bug fixes and future NumPy compatibilityCompare Source
This bug fix release is primarily to avoid deprecation warnings and future incompatibility with NumPy v1.24+.
🔴 Bug fixes
spancatfor docs with zero suggestions.smart_openrequirement and update deprecated options.spacy init config --gpufor environments withoutspacy-transformers.EditTreeLemmatizer.👥 Contributors
@adrianeboyd, @danieldk, @honnibal, @ines, @polm, @svlandeg
v3.4.3: : Extended Typer support and bug fixesCompare Source
✨ New features and improvements
🔴 Bug fixes
EntityLinker.Doc.to_json()for attributes set by getters.pipeline_package.load().spacy projectrequirements checks for unsupported specifiers and requirements lines.spacy.load(disable=)that could enable currently disabled components.👥 Contributors
@aaronzipp, @adrianeboyd, @honnibal, @ines, @polm, @rmitsch, @ryndaniels, @svlandeg, @thomashacker
v3.4.2: : Latin and Luganda support, Python 3.11 wheels and moreCompare Source
✨ New features and improvements
spacy.ConsoleLogger.v2optionally saves training logs to JSONL (#11214).DependencyMatcherto include matching parents or children to the left or the right of the node (#10371).cuda11xandcuda-autodetect(usingcupy-wheel) (#11279).Doc.to_json()andDoc.from_json()(#11125).enableanddisableoptions forspacy.load()more consistent (#11459).disable/enclude/excludeforspacy.load()(#11406).--urlflag forspacy infoto print the direct download URL for a pipeline (#11175).spacy projectCLI (#11226).spacy debug dataCLI for spancat data (#11504).spacy_versioninspacy packagemetadata (#11552).spacy project assets(#11458).spacy pretraincommand (#11210).natto-pyfor thekoextra (#11222).📦 Trained pipelines updates
This release includes updated English pipelines for spaCy v3.4 with improved NER performance. The updates in
en_core_web_*v3.4.1 address issues related to training from data with partial named entity annotation, which led to lower NER recall in English pipeline versions v3.0.0–v3.4.0. In particular, entities that appear in the sections of the OntoNotes training data without NER annotation were not predicted consistently by the earlier pipeline versions, such as names and places that are frequent in the Biblical sections, e.g., "David" and "Egypt" (see #7493).Use
spacy downloadto update your English pipelines to the newest version. If you'd prefer to keep using an earlier version, you can specify the version directly with e.g.spacy download -d en_core_web_sm-3.4.0. You can check that you are using the new version (v3.4.1) withspacy validate:🔴 Bug fixes
SetPredicate.Doc.__init__.pymorphy2_lookuplemmatizer mode for Russian and Ukrainian.Doctype, an error will now be raised (#11424).📖 Documentation and examples
spacy.models_and_pipes_with_nvtx_range.v1callback.ExampleAPI documentation.displacydocs.spacy project dvc.spacy-wordnet.initialize()function for pipeline components.Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.