Category: EBS Administration

Oracle E-Business Suite administration, WebLogic and application tier guides

Oracle EBS Login Issue: Real-Time Production Incident and Fix (OACORE + WebLogic Filter)
📌 Introduction

In Oracle E-Business Suite (EBS) environments, login failures are often perceived as simple application issues. However, in complex architectures, they can originate from multiple interacting layers across the application and middleware stack.

In this blog, I’ll walk through a real-world production incident where an OACORE JVM issue combined with WebLogic security filtering resulted in complete login inaccessibility.

This case highlights the importance of analyzing both performance and security layers together when troubleshooting critical application outages.

⚠️ Issue Summary
- Users were unable to access the EBS login page
- Pages were hanging or not loading
- WebLogic console reported:
```
Connection rejected, filter blocked Socket
```
🔍 Initial Observation

From the application server:
- Load Average: ~10+
👉 This indicated:
- High CPU utilization
- System under heavy stress
- Potential JVM thread contention
🔬 Detailed Analysis
- One of the OACORE managed server JVMs became unresponsive
- Long-running threads caused thread pool exhaustion
- Incoming user requests began queueing
At the same time:
- WebLogic connection filter was actively enforcing access rules
- Legitimate requests were being rejected under stressed conditions
🧠 Understanding the Components

OACORE (Application Layer)

Handles:
- Login requests
- Forms processing
- Core application logic
If JVM threads are exhausted:
👉 Requests queue → login hangs

Oracle WebLogic Server Connection Filter

EBS environments may use:
```
oracle.apps.ad.tools.configuration.wls.filter.EBSConnectionFilterImpl
```
This filter:
- Enforces IP-based access control
- Overrides default allow rules
If misconfigured or stressed:
👉 Legitimate traffic may be blocked

🎯 Root Cause Analysis (RCA)

The login issue was not caused by a single failure point, but by a combination of application tier resource exhaustion and restrictive middleware-level access control.
- High CPU utilization and long-running threads caused one OACORE JVM to become unresponsive
- Thread pool exhaustion led to request queuing, preventing new login requests from being processed
Simultaneously:
- The WebLogic connection filter (EBSConnectionFilterImpl) enforced strict access control policies
- Under high load conditions, legitimate client requests were rejected with “filter blocked Socket”
This interaction between performance degradation and security enforcement amplified the impact, resulting in complete login inaccessibility despite partial system availability.

🛠️ Resolution Approach (Controlled & Safe)

The resolution approach focused on stabilizing the JVM layer while validating and correcting middleware-level access controls in a controlled manner.

🔹 Step 1: Identify Unresponsive JVM
```
ps -ef | grep oacore
```
✔ Identify JVM with abnormal CPU or stuck behavior

🔹 Step 2: Handle Stuck JVM (Controlled Action)

⚠️ Important Note:

Forcefully terminating JVM processes should NOT be performed without validation.

✔ Recommended Approach:
- Confirm the process is unresponsive
- Ensure no critical transactions are running
- Prefer controlled shutdown where possible
✔ Example (Only if fully unresponsive and approved):
```
kill -9 <PID>
```
👉 Node Manager can restart the JVM automatically after termination

🔹 Step 3: Rolling Restart of OACORE
```
admanagedsrvctl.sh stop oacore_server1
admanagedsrvctl.sh start oacore_server1
```
✔ Ensures clean JVM state
✔ Restores thread pool balance

🔹 Step 4: Validate WebLogic Connection Filter

⚠️ Important Note:

The WebLogic connection filter is a security control and should NOT be disabled permanently.

✔ In this case:
- Filter behavior was validated as part of troubleshooting
- Temporary relaxation was used to confirm impact
✔ Recommended Approach:
- Review allowed IP ranges
- Validate filter configuration
- Re-enable filter after correction
The filter was re-enabled after validation and correction of configuration.

🔹 Step 5: Restart Admin Server (If Required)
```
adadminsrvctl.sh stop
adadminsrvctl.sh start
```
✔ Ensures configuration changes are applied

✅ Final Outcome
- System load reduced from ~10 → ~1.5
- OACORE JVMs stabilized
- Login page restored
- Users successfully accessed the application
📊 Key Learnings
1. OACORE JVM issues can manifest as complete application outages due to thread exhaustion and request queuing.
2. WebLogic connection filters can unintentionally block legitimate traffic, especially under high load conditions.
3. Multi-layer failures (application + middleware + security) significantly amplify incident impact.
4. System load metrics (CPU, load average) provide early indicators of JVM stress and should be monitored proactively.
🧩 Preventive Measures
- Monitor JVM thread utilization proactively
- Review connection filter configurations periodically
- Avoid peak concurrent load spikes
- Implement alerting for high CPU / load conditions
- Validate session and request handling patterns
🔐 Governance Considerations
- Avoid forceful JVM termination without proper validation
- Do not disable security controls without understanding impact
- Follow change management procedures in production
- Coordinate with application and security teams before changes
🏁 Conclusion

This incident demonstrates that login failures in Oracle EBS are not always isolated to a single component but can result from complex interactions across application performance and middleware security layers.

The combination of JVM resource exhaustion and connection filtering behavior created a compounded failure scenario, leading to complete login disruption.

A structured, multi-layer troubleshooting approach—focused on performance, configuration, and governance—enabled effective resolution while minimizing risk.

This reinforces the importance of analyzing both system behavior and security controls together when addressing critical production incidents.

💡 Pro Tip

When troubleshooting Oracle EBS login issues, always validate both:
- JVM health (thread utilization, CPU load)
- Middleware controls (connection filters, access rules)
Ignoring either layer can lead to incomplete or misleading diagnosis.
25/03/2026
Oracle EBS OACORE Server in FAILED_NOT_RESTARTABLE State: Real-Time Issue, RCA and Fix
In Oracle E-Business Suite (EBS) environments, application tier stability is critical to ensure seamless user experience. However, there are scenarios where managed servers behave unexpectedly and require manual intervention. This post walks through a real-world production issue where an OACORE managed server entered a FAILED_NOT_RESTARTABLE state, its impact, root cause analysis, and how it was resolved.

Environment Details
- Oracle E-Business Suite: R12.2.x
- Application Tier: WebLogic Managed Servers
- Component Impacted: OACORE Server (oacore_server1)
- Environment Type: Production
Problem Statement

An alert was received indicating oacore_server1 was in FAILED_NOT_RESTARTABLE state. Upon verification, the server was running but Node Manager could not auto-restart it.

Key Observations

Despite the OACORE server being in a failed state, the application remained accessible and functional — traffic was being handled by other OACORE servers. This is due to the multi-OACORE architecture with load balancing via OHS/Web tier. However, this creates a hidden risk: load redistribution increases pressure on remaining servers and can lead to cascading failures if not addressed promptly.

Detailed Analysis
- Managed server restart attempts failed during initialization
- Bulk concurrent requests were actively running
- CPU utilization spiked on the application tier
- JVM resources were under pressure
Understanding FAILED_NOT_RESTARTABLE

In Oracle WebLogic Server, a managed server is marked as FAILED_NOT_RESTARTABLE after repeated unsuccessful restart attempts. This is a protective mechanism designed to prevent unstable restart loops when the server cannot recover successfully.

Root Cause Analysis

The OACORE managed server entered FAILED_NOT_RESTARTABLE state due to repeated startup failures following an unclean or resource-constrained shutdown. High CPU utilization and heavy concurrent workload placed JVM resources under pressure, preventing a clean restart cycle. Residual runtime artifacts (such as incomplete shutdown state or resource locks) prevented successful reinitialization, causing WebLogic to mark the server as FAILED_NOT_RESTARTABLE.

Resolution
```
cd $ADMIN_SCRIPTS_HOME
./admanagedsrvctl.sh stop oacore_server1
./admanagedsrvctl.sh start oacore_server1
```
After the controlled restart, the server returned to RUNNING state with all deployments active and the application stable.

Identify Inactive Forms Sessions

Inactive sessions holding resources can contribute to JVM pressure. Use this query to identify them safely — do not terminate without proper validation and approvals:
```
SELECT s.sid,
       s.serial#,
       s.username,
       s.status,
       s.program,
       s.machine,
       ROUND(s.last_call_et/3600,2) AS hours_inactive
FROM v$session s
WHERE s.status = 'INACTIVE'
AND s.username = 'APPS'
AND s.program LIKE 'frmweb%'
AND s.last_call_et > 28800   -- 8 hours
ORDER BY hours_inactive DESC;
```
Reference only — do NOT execute without validation:
```
ALTER SYSTEM KILL SESSION 'SID,SERIAL#' IMMEDIATE;
```
Automate Session Monitoring

Use this script to monitor inactive sessions every 8 hours via cron:
```
#!/bin/bash
export ORACLE_SID=your_sid
export ORACLE_HOME=/path/to/oracle_home
export PATH=$ORACLE_HOME/bin:$PATH

sqlplus -s / as sysdba <<EOF
SET LINES 200
SET PAGES 200
SELECT COUNT(*) AS inactive_sessions
FROM v\$session
WHERE status='INACTIVE'
AND username='APPS'
AND program LIKE 'frmweb%'
AND last_call_et > 28800;
EXIT;
EOF
```
```
# Crontab entry - every 8 hours
0 */8 * * * /path/to/inactive_sessions.sh >> /tmp/inactive_sessions.log
```
DBA Quick Commands
```
-- Check system load
top
uptime
ps -ef | grep oacore

-- Check running concurrent requests
SELECT request_id, phase_code, status_code
FROM fnd_concurrent_requests
WHERE phase_code = 'R';
```
Key Takeaways
- Application may appear healthy even when an OACORE server fails due to load balancing
- FAILED_NOT_RESTARTABLE is a protective mechanism, not the root cause itself
- Resource pressure and restart failures must be analyzed together
- Controlled and governed actions are critical in production environments
- Proactive session monitoring via automation helps prevent recurrence
Have questions or faced a similar issue? Reach out at sdanwarahmed@gmail.com.
24/03/2026

Category: EBS Administration

Oracle EBS Login Issue: Real-Time Production Incident and Fix (OACORE + WebLogic Filter)

📌 Introduction

⚠️ Issue Summary

🔍 Initial Observation

🔬 Detailed Analysis

🧠 Understanding the Components

OACORE (Application Layer)

Oracle WebLogic Server Connection Filter

🎯 Root Cause Analysis (RCA)

🛠️ Resolution Approach (Controlled & Safe)

🔹 Step 1: Identify Unresponsive JVM

🔹 Step 2: Handle Stuck JVM (Controlled Action)

🔹 Step 3: Rolling Restart of OACORE

🔹 Step 4: Validate WebLogic Connection Filter

🔹 Step 5: Restart Admin Server (If Required)

✅ Final Outcome

📊 Key Learnings

🧩 Preventive Measures

🔐 Governance Considerations

🏁 Conclusion

💡 Pro Tip

Oracle EBS OACORE Server in FAILED_NOT_RESTARTABLE State: Real-Time Issue, RCA and Fix

Environment Details

Problem Statement

Key Observations

Detailed Analysis

Understanding FAILED_NOT_RESTARTABLE

Root Cause Analysis

Resolution

Identify Inactive Forms Sessions

Automate Session Monitoring

DBA Quick Commands

Key Takeaways