For text classification, accuracy shows overall correct predictions. But because some categories may be rare, precision and recall are very important.
Precision tells us how many documents labeled as a category truly belong there. This avoids false alarms.
Recall tells us how many documents of a category were found by the model. This avoids missing important documents.
F1 score balances precision and recall, giving a single number to compare models.