Which of the following best describes the primary function of Apache Ranger in a Hadoop ecosystem?
Think about what controls who can do what with data in Hadoop.
Apache Ranger centralizes authorization by managing access control policies across Hadoop components, ensuring users have the right permissions.
Given a Ranger policy that allows user 'alice' to read data in HDFS path '/data/reports' but denies write access, what will be the result of the following action?
Action: User 'alice' attempts to write a file to '/data/reports/2024_report.csv'
Read and write permissions are different; check what is explicitly allowed or denied.
Apache Ranger enforces explicit permissions. Since 'alice' only has read permission, write attempts are denied.
Review the following Ranger audit log snippet and identify why user 'bob' was denied access:
{
"user": "bob",
"resource": "/data/sales",
"accessType": "read",
"result": "DENIED",
"policyId": 102,
"reason": "No matching allow policy found"
}Focus on the 'reason' field in the log.
The log states 'No matching allow policy found', meaning 'bob' lacks explicit permission for the resource.
You want to create a Ranger policy that allows the group 'data_scientists' to read and write to the HDFS directory '/projects/analytics'. Which of the following policy configurations will achieve this?
Check which permissions and resource path match the requirement.
Assigning 'read' and 'write' permissions to the 'data_scientists' group on the exact resource '/projects/analytics' allows the desired access.
You have a bar chart showing the number of access requests by users over a week. The chart shows user 'carol' with a high number of denied requests compared to others. What is the most likely interpretation?
High denied requests usually mean permission issues or unauthorized access attempts.
A high number of denied requests for a user typically indicates they are trying to access resources without the needed permissions, which could be a misconfiguration or security concern.