Industrial strength Natural Language Processing (NLP) with Python and Cython
Questions tagged [spacy]
87 questions
6
votes
1 answer
Address parsing using spaCy
I am trying to parse addresses from various documents using spaCy using NER but the results are not so accurate.
I know this is bit generic question but it would be a great help if I could get reference of any past work or good articles or…
SKB
- 544
- 5
- 15
6
votes
1 answer
Is there any way to define custom entities in Spacy
1) I have just started working on NLP the basic Idea is to extract meaningful information from text. For this I am using "Spacy".
As far as I have studied Spacy has following entities.
ORG
PERSON
DATE
MONEY
CARDINAL
etc. But I want to add custom…
AddyProg
- 163
- 1
- 6
5
votes
2 answers
Converting paragraphs into sentences
I'm looking for ways to extract sentences from paragraphs of text containing different types of punctuations and all. I used SpaCy's Sentencizer to begin with.
Sample input python list abstracts:
["A total of 2337 articles were found, and, according…
Van Peer
- 285
- 1
- 3
- 12
5
votes
3 answers
Chunking Sentences with Spacy
I have a lot of sentences (500k) which looks like this:
"Penalty missed! Bad penalty by Felipe Brisola - Riga FC - shot with right foot is very close to the goal. Felipe Brisola should be disappointed."
"Penalty saved! Damir Kojasevic - Sutjeska…
senty
- 153
- 3
5
votes
1 answer
How to train a spacy model for text classification?
Can i know the way or steps to train a spacy model for text classification. (binary classification in my case)
Please help me with the process and way to approach.
krishna rao gadde
- 138
- 3
- 9
5
votes
1 answer
Spacy custom POS tagging for medical concepts
We are a group of doctors trying to use linguistic features of "Spacy", especially the part of speech tagging to show relationships between medical concepts like:
'Femoral artery pseudoaneurysm as in ==>
"femoral artery" ['Anatomical Location'] …
Sobath Premaratne
- 53
- 3
5
votes
1 answer
Meaning of NER Training values using Spacy
I am trying to train custom entities using Spacy. During the training process I am getting number of values of LOSS, score etc. What is the meaning of these values
============================= Training pipeline =============================
ℹ…
Aniiya0978
- 173
- 4
4
votes
2 answers
Does spaCy support multiple GPUs?
I was wondering if spaCy supports multi-GPU via mpi4py?
I am currently using spaCy's nlp.pipe for Named Entity Recognition on a high-performance-computing cluster that supports the MPI protocol and has many GPUs. It says here that I would need to…
Jinhua Wang
- 163
- 8
3
votes
2 answers
How to make the spacy 3.0 custom NER model training to optimize on precision rather than recall?
My current custom NER model is good on recall but I want to focus on improving precision, is it possible to change the optimizer metric in config file?
alias_neo92
- 33
- 5
3
votes
1 answer
Testing Spacy NER model
I've trained an NER model with the use of Spacy, and I would like to test the accuracy on a test dataset. What would be the best way to perform this?
Adnos
- 81
- 3
3
votes
0 answers
Medical NER for French language
I'm currently exploring the options to extract medical NER specifically for French language. I tried SpaCy's general French NER but it wasn't helpful to the cause (mainly because of the domain-specific requirements). I assume we cannot use Med7 or…
Van Peer
- 285
- 1
- 3
- 12
3
votes
2 answers
Is NLP suitable for my legal contract parsing problem?
My company has a product that involves the extraction of a variety of fields from legal contract PDFs. The current approach is very time consuming and messy, and I am exploring if NLP is a suitable alternative.
The PDFs that need to be parsed…
Posionus
- 31
- 1
3
votes
1 answer
Smart sentence segmentation not splitting on abbreviations
Sentencer from SpaCy and NLTK does not catch the fact that typical abbreviations (e.g. Mio. for Million in German) and the resulting sentence split is not correct. I understand that sentencers are supposed to be simple and quick but I am wondering…
sophros
- 209
- 2
- 11
3
votes
1 answer
Character-level embeddings in python
I'm working on an NLP task that requires the use of character level embeddings, and I've been trying to use Spacy. However, it seems that spacy uses word-level embeddings for the word vectors, and I need character-level embeddings. The only…
rmaguiar
- 163
- 1
- 7
3
votes
0 answers
Change the way spacy works - Custom properties for training and prediction
Spacy detects the entities using its predefined algorithm. It parses tokens in text considering position of tokens with respect to tokens surrounding it. It also takes into consideration the POS tagging for these tokens.
However, I believe it…
Sandeep Bhutani
- 884
- 1
- 7
- 22