| Document Reference: | MUST-IT-POL-002 |
| Version: | 1.0 |
| Effective Date: | 2025 |
| Review Date: | Annually |
| Classification: | Internal – Restricted |
| Issued By: | Information Technology Department |
| Approved By: | Chief of Information Technology |
1. PURPOSE
This policy establishes the requirements and procedures for maintaining redundant internet connectivity at Misr University for Science and Technology (MUST). The university relies heavily on internet access for academic instruction, research, administrative operations, cloud-based services, and e-learning platforms. Unplanned internet outages can severely disrupt core university functions. This policy ensures that robust, resilient, and continuously available internet connectivity is maintained across all campuses and facilities.
2. SCOPE
This policy applies to:
- All campus buildings, facilities, and remote sites operated or leased by MUST
- All IT infrastructure components that depend on internet connectivity
- The IT Department and all staff responsible for network operations
- Third-party Internet Service Providers (ISPs) and managed service vendors contracted by MUST
- Any department or unit planning to procure or modify internet connectivity services
3. DEFINITIONS
| Term | Definition |
| Redundant Connectivity | The provision of two or more independent internet connections so that if one fails, traffic automatically fails over to the other with minimal disruption. |
| Primary Link | The principal internet connection used under normal operating conditions. |
| Secondary Link (Failover) | A backup internet connection activated automatically or manually when the primary link fails or degrades. |
| Tertiary Link | An additional backup connection providing a third layer of resilience for mission-critical sites. |
| ISP (Internet Service Provider) | A company that provides internet access services to the university. |
| BGP (Border Gateway Protocol) | A routing protocol used to exchange routing information between multiple ISPs, enabling automatic failover. |
| SLA (Service Level Agreement) | A formal contract between MUST and an ISP specifying uptime guarantees, response times, and penalties. |
| Failover | The process of automatically switching to a redundant network connection when the primary connection fails. |
| Uptime | The percentage of time a network connection is available and operational. |
| MPLS | Multi-Protocol Label Switching – a high-performance networking technology used for dedicated WAN links. |
| SD-WAN | Software-Defined Wide Area Network – technology that allows intelligent traffic routing across multiple WAN links. |
4.POLICY OBJECTIVES
MUST is committed to the following connectivity objectives:
- Achieve a minimum internet uptime of 9% across all campus sites
- Ensure that no single point of failure can cause a complete loss of internet access
- Limit unplanned internet outage impact to critical systems to no more than 15 minutes
- Maintain active contracts with a minimum of two independent ISPs at all primary campuses
- Ensure seamless failover with no manual intervention required for Tier 1 sites
- Support all cloud services, LMS platforms, VoIP, video conferencing, and research networks with adequate bandwidth
- Comply with all applicable telecommunications regulations in the Arab Republic of Egypt
5. CONNECTIVITY ARCHITECTURE
5.1 Tiered Site Classification
All MUST sites are classified into three tiers based on operational criticality:
| Tier | Description | Minimum Links | Failover Requirement |
| Tier 1 – Critical | Main campus, data center, administration HQ | 3 links (Primary + 2 backups) | Automatic (<5 sec) |
| Tier 2 – Operational | Faculty buildings, labs, libraries | 2 links (Primary + 1 backup) | Automatic (<30 sec) |
| Tier 3 – Standard | Remote offices, residential facilities | 2 links (Primary + 1 backup) |
5.2 Primary Internet Connection Requirements
- Minimum bandwidth: 1 Gbps for main campus; 500 Mbps for faculty buildings; 100 Mbps for remote sites
- Fiber optic medium preferred; MPLS or dedicated leased lines acceptable
- Connection must terminate at the university’s core router via a separate physical path
- ISP must provide a guaranteed SLA with minimum 9% uptime and 4-hour fault resolution
5.3 Secondary (Failover) Connection Requirements
- Must be provisioned through a different ISP from the primary link
- Must use a physically diverse path (different conduit, exchange point, or medium)
- Minimum bandwidth: 50% of primary link capacity
- Must be live and monitored at all times – not dormant or cold-standby
- Failover must be tested quarterly
5.4 Tertiary Connection (Tier 1 Sites Only)
- May use 4G/5G mobile broadband as a tertiary emergency link
- Intended for emergency access only – not for general traffic load
- Must support critical services: email, VPN, ERP, and emergency communications
6. FAILOVER & TRAFFIC ROUTING
6.1 Automatic Failover
All Tier 1 and Tier 2 sites must be configured with automatic failover using one or more of the following technologies:
- BGP (Border Gateway Protocol) for multi-homing across multiple ISPs
- SD-WAN with health-based routing policies that detect packet loss, latency, or jitter above thresholds
- Hot-standby routing with sub-5-second detection and switchover for critical services
6.2 Routing Policies
- Primary link carries all production traffic under normal conditions
- Secondary link is active but in standby routing mode; it receives all traffic upon primary failure
- Load balancing across primary and secondary links is permitted for bandwidth optimization at Tier 1 sites
- QoS (Quality of Service) policies must be configured to prioritize: VoIP, video conferencing, LMS, and ERP traffic over secondary links
6.3 DNS Redundancy
- MUST must operate at minimum two DNS servers hosted with different providers
- DNS TTL (Time to Live) values must be configured appropriately to support rapid failover
- External DNS must resolve correctly regardless of which ISP link is active
7. ISP SELECTION & CONTRACT MANAGEMENT
7.1 ISP Selection Criteria
All ISPs contracted by MUST must meet the following minimum criteria:
| Criterion | Minimum Requirement |
| Uptime SLA | 99.9% or higher (8.76 hours maximum downtime per year) |
| Mean Time to Repair (MTTR) | 4 hours or less for critical faults |
| Bandwidth Scalability | Ability to upgrade bandwidth within 30 days |
| Peering & Routing | Tier-1 or Tier-2 ISP with diverse peering points |
| Support Coverage | 24/7 NOC (Network Operations Center) with dedicated account management |
| Geographic Redundancy | Different physical infrastructure from competing ISP |
| Security | DDoS mitigation, BGP route filtering, abuse management |
7.2 Contract Requirements
- All ISP contracts must be reviewed and approved by the IT Director prior to signing
- Contracts must specify bandwidth guarantees, SLA penalties for downtime, escalation paths, and notice periods
- A minimum contract duration of 12 months is required for primary links
- SLA performance reports must be submitted by ISPs monthly
- Contracts must include a right-to-audit clause allowing MUST to verify performance data
7.3 ISP Performance Review
- ISP performance is reviewed quarterly by the IT Director
- Any ISP failing to meet SLA three consecutive months is subject to contract review or termination
- Alternative ISPs must be evaluated annually to ensure competitive pricing and technology currency
8. MONITORING & ALERTING
8.1 Continuous Monitoring
- All internet links must be monitored 24/7 using an automated Network Management System (NMS)
- Monitored parameters include: link status (up/down), latency, packet loss, jitter, and throughput utilization
- Monitoring probes must be sent every 30 seconds or less on all active links
- Historical performance data must be retained for a minimum of 12 months
8.2 Alerting Thresholds
| Metric | Warning Threshold | Critical Threshold |
| Link Availability | < 99.9% in rolling 30 days | < 99.5% in rolling 30 days |
| Latency (to ISP gateway) | > 20 ms | > 50 ms |
| Packet Loss | > 0.5% | > 1% |
| Bandwidth Utilization | > 75% sustained | > 90% sustained |
| Failover Activation | Any failover event | Failover lasting > 15 min |
8.3 Alert Escalation
- Warning alerts: Notify on-call IT engineer via email and SMS
- Critical alerts: Notify IT Director, Network Engineer, and NOC team simultaneously
- Unresolved critical alerts after 30 minutes: Escalate to IT Director and VP for Operations
9. TESTING & VALIDATION
The following tests must be conducted to validate the redundancy capability:
| Test Type | Frequency | Owner | Documentation |
| Failover test (secondary link activation) | Quarterly | Network Engineer | Required |
| Full traffic switchover drill | Semi-annually | IT Director | Required |
| ISP SLA compliance audit | Quarterly | IT Director | Required |
| Bandwidth capacity test | Semi-annually | Network Engineer | Required |
| Emergency tertiary link test (Tier 1) | Annually | Systems Admin | Required |
| DNS failover validation | Quarterly | Network Engineer | Required |
10. SECURITY REQUIREMENTS
- All internet-facing connections must be protected by a next-generation firewall (NGFW) with active threat intelligence
- BGP peers must be authenticated using MD5 or TCP-AO to prevent route hijacking
- DDoS protection must be active on all primary and secondary links
- Network traffic must be monitored for anomalies using an Intrusion Detection/Prevention System (IDS/IPS)
- All internet connections must pass through centralized security inspection – split tunneling without security inspection is prohibited
- Changes to BGP routing policies or ISP configurations require change management approval
11. INCIDENT MANAGEMENT
11.1 Incident Classification
| Severity | Definition | Response Time |
| P1 – Critical | Complete loss of all internet links on any campus | Immediate – 15 min |
| P2 – High | Primary link failure; failover active | 30 minutes |
| P3 – Medium | Degraded performance on primary link | 2 hours |
| P4 – Low | Minor intermittent issues; no user impact | Next business day |
11.2 Incident Response Steps
- Detection: Alert triggered by NMS or reported by user
- Triage: Network Engineer assesses severity within 5 minutes
- Escalation: P1/P2 incidents escalated to IT Director immediately
- ISP Engagement: ISP contacted within 15 minutes for P1 and P2 faults
- Stakeholder Notification: Affected departments notified within 30 minutes for P1 incidents
- Resolution & Documentation: Incident resolved; root cause analysis completed within 48 hours
12. ROLES & RESPONSIBILITIES
| Role | Responsibility |
| IT Director | Policy owner; approves ISP contracts; reviews SLA performance; authorizes architecture changes |
| Network Engineer | Configures and maintains all internet links; performs failover tests; responds to P1/P2 incidents |
| Systems Administrator | Monitors NMS alerts; manages DNS; supports failover validation |
| IT Security Officer | Ensures security controls on all internet links; approves BGP and firewall changes |
| University VP (Operations) | Executive escalation contact for prolonged outages; approves emergency procurement |
| Department Heads | Report connectivity issues; communicate outages to their teams; participate in DR exercises |
| ISP Account Managers | Provide monthly SLA reports; respond to incidents per contract; support capacity planning |
13. CAPACITY PLANNING
- Bandwidth utilization on all links must be reviewed monthly
- Link upgrades must be initiated when sustained utilization exceeds 70% over any 30-day period
- An annual capacity plan must be submitted to university management each October
- Future bandwidth projections must account for growth in student enrolment, online learning adoption, and cloud migration
- Emergency bandwidth procurement procedures must be documented and approved in advance
14. COMPLIANCE & REGULATORY REQUIREMENTS
- MUST shall comply with all telecommunications regulations issued by the National Telecommunications Regulatory Authority (NTRA) of Egypt
- All ISPs used must be licensed to operate in the Arab Republic of Egypt
- Internet connectivity must support compliance with Egyptian Cybersecurity law No. 175 of 2018
- Traffic logs must be retained as required by applicable Egyptian laws and MUST data retention policies
- Non-compliance with this policy may result in disciplinary action for staff or contract termination for vendors
15. POLICY REVIEW & VERSION CONTROL
This policy shall be reviewed annually by the IT Director and updated to reflect changes in technology, regulatory requirements, campus expansion, or lessons learned from incidents. All amendments require written approval from the University President.
| Version | Date | Author | Changes |
| 1.0 | 2025 | IT Department | Initial release |