Model Pipeline - Information extraction patterns
This pipeline extracts useful facts from text using patterns. It finds specific information like names, dates, or places by matching text shapes and words.
Jump into concepts and practice - no test required
This pipeline extracts useful facts from text using patterns. It finds specific information like names, dates, or places by matching text shapes and words.
Loss
1.0 |****
0.8 |****
0.6 |****
0.4 |****
0.2 |****
0.0 +----
1 2 3 4 5 Epochs
| Epoch | Loss ↓ | Accuracy ↑ | Observation |
|---|---|---|---|
| 1 | 0.85 | 0.60 | Model starts learning basic patterns, accuracy is moderate. |
| 2 | 0.60 | 0.75 | Loss decreases as model better recognizes entities. |
| 3 | 0.45 | 0.82 | Model improves entity extraction accuracy. |
| 4 | 0.35 | 0.88 | Loss continues to decrease, accuracy nearing good performance. |
| 5 | 0.28 | 0.91 | Model converges with high accuracy on pattern extraction. |
\b\d{4}-\d{2}-\d{2}\b matches a 4-digit year, 2-digit month, and 2-digit day separated by dashes, which is a common date format.\d+\s+\w+ (matches any number followed by a word) matches number + word but is too general; C matches emails; A matches uppercase words, not dates.\b\d{4}-\d{2}-\d{2}\b (matches YYYY-MM-DD format) [OK]\b(Mr|Ms|Dr)\.\s+[A-Z][a-z]+\b, what will be the output when applied to the text: "Dr. Smith and Mr. Johnson went to the park."?\b[\w.-]+@[\w.-]+\b\b[A-Z][a-z]+\s+[A-Z]{2}\b matches a capitalized word, a space, then exactly two uppercase letters, fitting the example.\b[A-Z][a-z]+\s+[A-Z]{2}\b (capitalized city name + space + two uppercase letters) [OK]