Beruflich Dokumente
Kultur Dokumente
Helder Mendes
Júri
Presidente: Prof. Nuno Cavaco Gomes Horta
Orientador: Prof. Bertinho Manuel Andrade da Costa
Vogal: Prof. Agostinho Rui Alves da Fonseca
2012/06/12
Acknowledgements
First of all I would like to thank my family, my wife Fernanda and my sons Afonso
and Mafalda, for providing me the environment for this work to be produced and bear
with my absent time. To my parents for always encouraged my studies in all stages of
my life.
A great word of gratitude to the coordinator of this work, Prof. Bertinho Andrade
Costa from Instituto Superior Técnico (IST) and Instituto de Engenharia de Sistemas
e Computadores - Investigação e Desenvolvimento (INESC - ID), for his continuous
guidance, support in my doubts and for the long hours spent discussing all the details.
A special word of gratitude to my colleague from TAP Portugal, Engo Manuel Tereno
who was always willing to contribute with his huge knowledge on dierent areas. For
the support and advice.
To my colleagues in TAP from the Flight Data Analysis area, that contribute ev-
eryday for the improvement of the analysis produced.
Abstract
This work presents several techniques to classify Autoland ights starting from the
Standard algorithms provided by the aircraft manufacturers and former software imple-
mentations, evolving then to newer concepts of unidimensional and multidimensional
statistical approaches. Several methods have been tried to distinguish between Normal
from Abnormal automatic landings based on ight data analysis. Together with this
analysis, the automatic procedure to detect and classify new automatic landings was
implemented with the use of Support Vector Machines.
Keywords: Autoland, Flight Data Analysis, Kernel Machines, Support Vector Ma-
chines, Principal Component Analysis, Single Value Decomposition.
Resumo
Neste trabalho são apresentadas várias técnicas de classicação de voos Autoland
(Aterragens Automáticas) com inicio na utilização de algoritmos standard fornecidos
pelos fabricantes de avião e utilizados em programas de análise no passado, evoluindo
para novos conceitos estatísticos de análise unidimensional e multidimensional. Vários
métodos foram testados para fazer a distinção entre aterragens automáticas Normais
e Não Normais com base na análise dos dados de voo. A par desta análise, foi criado
um procedimento automático para detectar e classicar novas aterragens automáticas
com uso de Support Vector Machines.
Palavras-Chave: Aterragens Autoamáticas, Análise de dados de voo, Kernel Ma-
chines,Support Vector Machines, Análise das Componentes Principais, Decomposição
em Valores Singulares.
Contents
1 Introduction 1
1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Aircraft Recording Systems . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2.1 Flight Data Recorders . . . . . . . . . . . . . . . . . . . . . . . 2
1.2.2 Acquisition Units and Non-Crash Recorders . . . . . . . . . . . 5
1.2.3 Flight Data Recording Process . . . . . . . . . . . . . . . . . . . 9
1.3 Flight Data Analysis Software - AGS (Analysis Ground Station) . . . . 10
1.4 Previous Work and Thesis Scope . . . . . . . . . . . . . . . . . . . . . 11
1.5 Thesis Organization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
2 Autoland Analysis 14
2.1 The Autoland Monitoring . . . . . . . . . . . . . . . . . . . . . . . . . 15
2.2 Standard Procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.2.1 Poor Integrity (PI) . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.2.2 Poor Performance (PPF) . . . . . . . . . . . . . . . . . . . . . . 18
2.3 Key Point Data Extraction . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.3.1 General Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.3.2 Air Sector Data . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
2.3.3 Ground Sector Data . . . . . . . . . . . . . . . . . . . . . . . . 21
2.3.4 AGS Procedure Output . . . . . . . . . . . . . . . . . . . . . . 21
2.4 Autoland and Autoapproach Decision Logics . . . . . . . . . . . . . . . 22
2.4.1 Logics for Flight Type Determination . . . . . . . . . . . . . . . 23
2.5 Autoland Evaluation with Standard Procedure (PPF and PI) . . . . . . 26
2.6 Histogram Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.6.1 Evaluation File Format . . . . . . . . . . . . . . . . . . . . . . . 28
2.7 Outlier Identication and Removal . . . . . . . . . . . . . . . . . . . . 30
2.8 Criteria to Identify Abnormal Cases . . . . . . . . . . . . . . . . . . . . 33
2.9 Multidimensional Evaluation . . . . . . . . . . . . . . . . . . . . . . . . 34
2.9.1 Principal Components Analysis (PCA) . . . . . . . . . . . . . . 34
2.9.2 Biplot Graphics . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
2.10 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
4 Presentation of Results 54
4.1 Applications of the methods to novelty data . . . . . . . . . . . . . . . 55
4.2 Practical Application - Autoland Flights Year 2011 . . . . . . . . . . . 55
4.2.1 Input Space ±3σ Evaluation . . . . . . . . . . . . . . . . . . . . 55
4.2.2 Principal Components ±3σ Evaluation . . . . . . . . . . . . . . 56
4.2.3 SVM Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . 56
4.2.4 Conclusions on the Obtained Results . . . . . . . . . . . . . . . 56
4.3 Practical Application - Autoland Flights Year 2009 . . . . . . . . . . . 58
4.3.1 Input Space ±3σ Evaluation . . . . . . . . . . . . . . . . . . . . 58
4.3.2 Principal Components ±3σ Evaluation . . . . . . . . . . . . . . 58
4.3.3 SVM Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . 59
4.3.4 Conclusions on the Obtained Results . . . . . . . . . . . . . . . 59
B Additional Procedures 69
B.1 Wind Component Calculation . . . . . . . . . . . . . . . . . . . . . . . 69
B.1.1 AGS code for Wind Components Calculation . . . . . . . . . . . 70
B.2 Touch Down Point Determination . . . . . . . . . . . . . . . . . . . . . 71
B.2.1 AGS code for Touch Down determination . . . . . . . . . . . . . 75
B.3 Baro-Inertial Vertical Speed and Radio Altitude Rate . . . . . . . . . . 76
B.3.1 Gaussian Smoothing . . . . . . . . . . . . . . . . . . . . . . . . 81
B.3.2 Matlab code for Gaussian Smoothing . . . . . . . . . . . . . . . 84
B.4 Flaps/Slats Conguration . . . . . . . . . . . . . . . . . . . . . . . . . 85
B.5 Thrust Lever Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
B.6 Threshold Crossing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
B.6.1 AGS code for Threshold Crossing Computation . . . . . . . . . 88
B.7 Localiser and Glide-Slope Information . . . . . . . . . . . . . . . . . . . 89
B.7.1 G/S and LOC Track . . . . . . . . . . . . . . . . . . . . . . . . 90
B.7.2 Localizer and Glide-Slope Units . . . . . . . . . . . . . . . . . . 90
B.8 Calculations Performed After Touch-Down . . . . . . . . . . . . . . . . 90
B.8.1 Time calculations . . . . . . . . . . . . . . . . . . . . . . . . . . 91
B.8.2 Reverse Type determination . . . . . . . . . . . . . . . . . . . . 91
Bibliography 103
List of Figures
2.1 DH and RVR values for Autoland (European Aviation Safety Agency (EASA) Legislation) 15
2.2 Autoland Gating Checks for A319 Fleet . . . . . . . . . . . . . . . . . . 18
2.3 Autoland Gating Checks for A320 Fleet . . . . . . . . . . . . . . . . . . 18
2.4 Autoland Gating Checks for A321 Fleet . . . . . . . . . . . . . . . . . . 18
2.5 Autoland Gating Checks for A330 and A340 Fleets . . . . . . . . . . . 19
2.6 Autoland Common Gating Checks for Airbus A320 family . . . . . . . 19
2.7 Autoland Keypoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.8 AGS File format - Keypoint data extraction . . . . . . . . . . . . . . . 21
2.9 Autoland and Autoapproach Truth Table . . . . . . . . . . . . . . . . . 24
2.10 Mnemomics used in Table 2.9 . . . . . . . . . . . . . . . . . . . . . . . 24
2.11 PPF conditions distributions . . . . . . . . . . . . . . . . . . . . . . . . 27
2.12 Airbus A320fam Published Pitch Limits . . . . . . . . . . . . . . . . . 27
2.13 Parameters Extracted by keypoint . . . . . . . . . . . . . . . . . . . . 28
2.14 keypoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.15 Ground Calculations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.16 Evaluation File Format . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.17 Biplot Graphic for the 354 Autoland detected . . . . . . . . . . . . . . 38
xi
DAR Digital Aircraft Integrated Data System Recorder
DES Descend
DFDR Digital Flight Data Recorder
DFDRS Digital Flight Data Recorder System
DH Decision Height
DMC Display Management Computer
DMU Data Management Unit
EASA European Aviation Safety Agency
ECAM Electronic Centralised Aircraft Monitoring
ECM Engine Condition Monitoring
ECU Engine Control Unit
EFIS Electronic Flight Instrument System
EHM Engine Health Monitoring
EIVMU Engine Interface Monitoring Unit
EMI Electro Magnetic Interference
EUROCAE European Organization for Civil Aviation Electronics
EWD Engine Warning Display
FAA Federal Aviation Administration
FAC Flight Augmentation Computer
FADEC Full Authority Digital Engine Controller
FCDC Flight Control Data Concentrator
FCOM Flight Crew Operations Manual
FDAU Flight Data Acquisition Unit
FDIMU Flight Data Interface and Management Unit
FDIU Flight Data Interface Unit
FDM Flight Data Monitoring
FDR Flight Data Recorder
FF Engine Fuel Flow
FGC Flight Guidance Computer
FLX Auto-Thrust FLEX Mode
FMA Flight Mode Annunciator
FMGC Flight Management and Guidance Comp
FO First Ocer
FOQA Flight Operations Quality Assurance
FPPU Feedback Position Pick-o Unit
FWC Flight Warning Computer
GA Go Around
xii
G/S Glide Slope
GND CTL PB Ground Control Push-Button
GPL General Public License
GPRS General Packet Radio Service
GRAF Ground Replay and Analysis Facility
GSM Groupe Spécial Mobile
ILS Instrument Landing System
IM Inner Marker
IRS Inertial Reference System
IVV Inertial Vertical Velocity
JAA Joint Aviation Authority
KTS Knot - Nautical Miles/Hour
LA Linear Accelerometer
LAND TRK LAND TRACK Autopilot Mode
LG Landing Gear
LH Left Hand
LO Lift O
LOC Localiser
LGCIU Landing Gear Control Interface Unit
LVP Low Visibility Procedure
LVTO Low Visibility Take-O
MCDU Multipurpose Control and Display Unit
MCT Maximum Continuous Thrust
MEL Minimum Equipment List
MLG Main Landing Gear
MM Middle Marker
MO Magneto-Optical
MOQA Maintenance Operations Quality Assurance
N1 Engine Low Rotor Speed Rotation
N2 Engine High Rotor Speed Rotation
NACA National Advisory Commitee for Aeronautics
NASA National Aeronautics and Space Administration
ND Navigation Display
NLG Nose Landing Gear
OM Outer Marker
PB Push Button
PCA Principal Component Analysis
xiii
PFD Primary Flight Display
PI Poor Integrity
PPF Poor Performance
QAR Quick Access Recorder
RA Radio Altitude
RAs Radio Altimeters
RH Right Hand
RF Radio Frequency
RVR Runway Visual Range
RZ Return to Zero
SD System Display
SDI Source Destination Identier
SSCVR Solid State Cockpit Voice Recorder
SDAC System Data Acquisition Concentrator
SFCC Slat Flap Control Computer
SFIM Société de Fabrication d'Instruments de Mésure
SIM Subscriber Identity Module
SSFDR Solid State Flight Data Recorder
SSM Sign Status Matrix
SVM Support Vector Machine
SVMs Support Vector Machines
SVD Singular Value Decomposition
TAP Transportes Aéreos Portugueses
TD Touch Down
TDZ Touch Down Zone
TOGA Take-O / Go-Around
TR Thrust Reverse
TRK Track Angle
VMC Visual Meteorological Conditions
WPS Words Per Second
WQAR Wireless Quick Access Recorder
xiv
List of Symbols
xv
xvi
Chapter 1
Introduction
1.1 Motivation
1
the workload of the operator.
This work represents also, an opportunity of cooperation between the industry and
the university, that was triggered by TAP's Portugal (TAP) interest to continuous
improve the monitoring of the Autoland Flight Data Analysis due to legal, performance
and safety reasons. It establishes some guiding directions for new approaches for the
analyses of all the landings in the future and in other Flight Phases as well.
Therefore, being TAP committed to the highest safety standards and practices in
the airline industry, it identied the opportunity to perform the assessment of its Au-
toland 1 ights. The aim is to improve TAP 's capabilities by studying new approaches
to the assessment of Autoland Flights. This corresponds also to the European legal
requirement, through EASA regulations, to perform the continuous monitoring for all
Autoland Flights (EU OPS 1.440)[1].
All the landings studied in this project are Autolands, i.e., automatically guided.
Even though there is no human interaction on the aircraft guidance, due to airline's
condentiality reasons, there is no identication of any individual ight in this project.
2
Figure 1.1: SFIM - Photographic Recorder Figure 1.2: SFIM - Recorded Parameters
• Airspeed
• Heading
• Altitude
These devices were used during the 60's and early 70's.
TAP was one of the rst airlines in the world to have a ight data analysis program
established starting with photographic recorders. In 1962 TAP started to introduce in
its eet the jet aircraft, with the Sud Aviation Caravelle VI-R, and kept track on this
transition with the photographic SFIM recorder (Figure 1.1).[5]
With the introduction of the magnetic tape, and the expansion of electronic indus-
tries, the recorders could improve the reliability and a larger number of parameters
could be stored. In Table 1.1, the number of parameters recorded on eets using these
new capabilities is presented.
Table 1.1: Parameters for dierent Fleets
The Digital Flight Data Recorder (DFDR) (Figure 1.3), commonly known as Black
Box or Crash Recorder as it is related normally in the news with serious accident
situations. In fact the goal of this recorder is to be used in such situations so that the
investigation authorities determine what caused the incident or accident.
Authorities required that the last 25 hour of ight data shall be recorded. The
assembly presented in Figure 1.4 and Figure 1.5 is a tape device capable to perform
the recording of the data continuously for the period required by the authorities. In
this way inside the DFDR there is always the last 25 ight hours.
3
Figure 1.3: Digital Flight Data Recorder - DFDR
Figure 1.4: DFDR - Crash Protected Tape Figure 1.5: DFDR - 25 H endless tape
DFDR recorders are crash-protected, the tape is protected inside a case (Figures
1.4 and 1.5), i.e., they are manufactured to survive harsh conditions. Tests of re
resistance, pressure stress and uid immersion3 are performed. The crash recorders
must comply with the specications from European Organization for Civil Aviation
Electronics (EUROCAE) document ED-112 [6]. It specify the conditions that a recorder
must cope without aecting the recorded data and its readability. These are resumed
in Table 1.2.
Table 1.2: FDR Crash Survivable Test conditions
Test Conditions
Impact Shock 3400G, 6.5 milliseconds
Penetration Resistance 500 lb. weight from 10 feet
Static Crush 5000 lbs., 5 minutes
High Temperature Fire 1100 ◦ C, 30 minutes
Low Temperature Fire 260 ◦ C, 10 hours
Deep Sea Pressure 20,000 feet, 30 days
Fluids Immersion Min 48H per ED-112 1
In the 90's a new development on the Flight Recorder was produced with the incor-
poration of solid state memories for the recording instead of the magnetic tape. The
same crash protection standards as in Tab 1.2 apply to this electronic equipment and it
was renamed to Solid State Flight Data Recorder (SSFDR) after its technology. In this
recorder, the solid state memories are embeded into several protection layers (Fig 1.6).
This part of the Solid State Flight Data Recorder (SSFDR) is named Crash Surviv-
3 Fluids that are normally encountered are: - Jet Fuel , - Engine Oil, - Hydraulic Fluid
4
able Memory Unit (CSMU). This new manufacturing, avoided the mechanical moving
parts of the tape recorders. The wear parts are replaced by memories increasing the
reliability of the recorder and producing a much easier maintenance. Two models of
SSFDRs are presented on Fig 1.7
In most aircraft and also on Airbus eets both SSFDR and Solid State Cockpit
Voice Recorder (SSCVR) are installed on tail cone on the back of the aircraft (Figure
1.8). To simplify the notation all the types of crash protected ight data recorders will
be designated as Flight Data Recorder (FDR) in the remaining of this document.
Figure 1.7: Two models of SSFDR Figure 1.8: SSCVR(bottom) and SSFDR(top)
Aircraft Installation
5
Figure 1.9: FDIMU Mandatory
6
Figure 1.10: Tape Recorder Figure 1.11: Tape for Flight Data Recording
used for computer backups in late 70's and 80's and the MO disk the same technology
as Compact Disk Rewritable (CDRW).
There is a major drawback on the use of this type of media, which is the logistics
associated with its handling. There is a large number of media that has to be collected
on aircraft and transported with strong human intervention. Again the technology
used nowadays in cellular communications could provide a solution to this problem [9]
(Fig 1.14). In fact, the state-of-the-art for non-crash recorders is the Wireless Quick
Access Recorder (WQAR) makes use of commercial cellular phone networks for data
transmission.
In Fig 1.15 a typical network layout is presented. The WQAR is equipped with a
maximum of eight cellular radios, each one with its own Subscriber Identity Module
(SIM) card installed. For TAP conguration an intermediate solution of four cellular
radios is in place.
Transmission takes place only on ground when the local cellular phone operator
with which TAP has the contract for mobile communications is available and two
extra conditions are met. The rst is one signal from the Landing Gear indicating the
ground condition, and the second is one signal indicating that one of the passenger
doors is open.
7
Figure 1.14: Wireless QAR
The fact that the transmission occurs normally with the local cellular operator
prevents the costs to become high. If an incident occurs abroad, there is the option to
perform a download-on-demand. In TAP facilities (Figure 1.15), in order to activate
a remote download, the Ground Station establishes a cell phone call with one of the
numbers from the SIM cards installed on the WQAR and sends one SMS message. If
the recorder is properly congured and there is GSM/GPRS network available where
the aircraft stands, the data will be transmitted from the remote cellular provider
(roaming option). In this way there is no need to attend the physical media to perform
an incident analysis.
The same rules for passenger's electronic equipment apply to the cellular radios on
the WQAR and thus these phones are disabled in ight, therefore there is no electro-
magnetic interference with the remaining equipment on the avionics bay.
The second part of the FDIMU was formerly known as DMU on older generations
of Airbus aircraft. This part deals with Non-Mandatory data and this system is named
as Aircraft Integrated Data System (AIDS) on A320 eets and Aircraft Condition and
Monitoring System (ACMS) on A330 / A340 (Fig 1.16).
For the Non-Mandatory system the FDIMU acts in the same way as for the Manda-
tory (Fig 1.9) in the sense that is collects data from a set of Arinc 429 buses from
dierent systems in the aircraft, and packages the data into Arinc 573/717 words to be
8
Figure 1.16: FDIMU Non-Mandatory
stored on a recorder named DAR. In this conguration the FDIMU can be customised,
i.e., the airline chooses the parameters that needs to be recorded in order to match
its dierent analyses. In this way, the use of the DAR has the advantage of extending
the amount of data available on the Mandatory part, provided the acquisition unit is
programmed to this eect.
Besides the customization of the acquisition unit, the Non-Mandatory system also
produces Reports that can be generated during ight, and accessed at this time or later
on ground.
The recorder itself (physical equipment) is the same as the QAR presented in Figures
1.10, 1.12 and 1.14. It is the connection position inside the avionics bay, that denes
whether this equipment is installed as a QAR or a DAR. Depending on the position
installed the contents of the recordings are dierent.
The acquisition block of Fig 1.17 normally receives information on Arinc 429 stan-
9
dard containing the information that can be formatted in Arinc 573/717 words and
recorded. The computers or avionic units that perform this task can have dierent
designations depending on its function (Mandatory/Non-Mandatory) or manufacturer,
the most known in the industry are:
The designation Flight Data Acquisition Unit (FDAU) is normally the name for
acquisition units in Boeing eet. It is one equipment that deals with both the Manda-
tory and Non-Mandatory parts, having internally electronic boards dedicated to each
of these functions. The FDIMU, FDIU and DMU are designations from Airbus air-
craft, and its functioning was addressed in Sections 1.2.1 and 1.2.2. In short FDIU
and DMU are dedicated separately to Mandatory and Non-Mandatory data respec-
tively. These are avionic data acquisition computers from older generation aircraft.
The newest merge the two computers into a single one named FDIMU.
The full description of the transmission protocol Arinc 429 and the ight data record-
ing protocol Arinc 573/717 can be found in references [7] and [8].
10
Figure 1.18: AGS Conception (From Analysis Ground Station V12 Users Manual [10])
11
mal values from ights that are not detected with the Standard Procedure. The rst
approach to this problem is described in Section 2.6 corresponding to the Histogram
Evaluation of the data. It is a statistical study to determine which parameter values
are deemed to be considered as atypical or outliers.
In a perspective of implementing new methodologies and future developments, spe-
cially creating one automatic classication, new concepts for data mining will be applied
to the training set derived from the Histogram Evaluation.
During the search phase for new solutions it was found that the newest developments
with Flight Data classication are performed with Support Vector Machines (SVM).
The work from NASA in reference [15] is one of the most developed work in this area
(up to the author's knowledge). Other references ([16], [17], [18], [19]) explore the
SVMs capabilities to nd atypical cases within a set of new vectors from which there
is no previous knowledge from its normal or abnormal state. This approach is to be
fully described in Chapter 3.
The objectives to be achieved with this thesis are resumed in Fig 1.19.
12
sentation of technology and protocols existing in the industry and a brief history
of Flight Recording.
• Chapter 2 presents the Autoland and its classications, starting with the criteria
provided by the aircraft manufacturer. It focus the creation of a training set by
the labelling of atypical data with the criteria studied. The multivariate analysis
of the data is introduced with the Principal Component Analysis (PCA).
• Chapter 3 concerns Kernel Methods and Support Vector Machines. In this chapter
it is studied the use of learning machines to detect abnormal or atypical patterns
on the Autoland Flight Data. Problem formulations conducting to optimization
problems with restrictions conduct to minimum and maximum function calcula-
tions using Lagrangean Multipliers.
• Chapter 4 presents the results and the comparison of the methods presented in the
previous Chapters. The testing of novelty data is performed with these methods.
• Chapter 5 contains the presentation of the conclusions obtained from the results
in Chapter 4. Guiding to future developments and nal considerations.
• Appendix A there is a full description of the ight data used in this project, for
A320 family of aircraft.
• Appendix B describes key procedures in AGS that had to be studied and pro-
grammed in order to have feasible data for the Autoland. These are general
procedures and considerations that are used on any analysis.
• Appendix C - Vol II is the complete listing of the Autoland procedure in AGS
developed during this project.
• Appendix D - Vol II. Description of C support programs and script les that are
necessary to the implementation of this project.
• Appendix E - Vol II. Full details of LIBSVM package with application to model
construction with Autoland data.
13
Chapter 2
Autoland Analysis
The Autoland is the automatic control of the aircraft during approach and landing.
Normally it is done in low visibility conditions and pilots see the runway lights just
a few seconds before the landing occurs[20]. This operation is considered a highly
safe manoeuvre due to the automatic control systems present in avionics equipments
dedicated to the Autoight systems. But for those landings to happen, there are strict
requirements imposed by the authorities on aircraft equipment, on aireld equipment
and special crew qualication that pilots must attain.
In this work, the dierent studies for Autoland assessment, aim to have a robust
characterization of real abnormal cases. This concept is established rst, by the limits
provided by the aircraft manufacturer, and then some new approaches to the problem
with dierent algorithms are studied.
The Autoland is part of a broader concept described as the All Weather Operation
(AWO), which allows one airline to operate in extreme low visibility conditions. This
has strong economical impact in the airline operation, allowing the schedules to be
kept during the whole year and avoiding alternate airports, extra transportation and
hotel expenses.
The Approach and Landing ight phases considered in this work make use of the
Autoight system. During the descend and in order to follow the path reference pre-
cisely, two autopilots are needed together with the Instrument Landing System (ILS)
(Section B.7). In some cases it is enough that a certain automatic guidance is done, till
the point where the necessary visibility of the runway is achieved. In this situation the
pilot can disconnect the autopilots and proceed manually with the rest of the approach
and landing. There is not a full use of the autopilots till landing, and this is considered
one Autoapproach.
There are three categories for an Autoland, known as CAT I, CAT II and CAT
III which are determined by the ceiling of the clouds and the visibility conditions of
the day. The ceiling and visibility are indicated by two quantities which are, Decision
Height (DH) and Runway Visual Range (RVR).
According to EASA legislation, the denition of DH is:
Decision Height is the wheel height above the runway elevation by which a Go-Around
must be initiated unless adequate visual reference has been established and the aircraft
position and approach path have been assessed as satisfactory to continue the approach
14
and land safely.
The RVR is the visibility distance in the horizontal on the runway.
The respective values of these quantities for each category are presented in Table
2.1.
Table 2.1: DH and RVR values for Autoland (EASA Legislation)
Category DH RVR
CAT I 200 ft 550 m
CAT II 100 ft 300 m
CAT IIIA 50 ≤ DH < 100 RVR ≥ 200 m
CAT IIIB 0 ≤ DH < 50 RVR ≥ 75 m
CAT IIIC No DH No RVR
It is important to note that the RVR is the visibility on the runway. The visibility
during the descend when autoland is being performed is normally zero, i.e., the landing
is performed inside clouds, which makes this kind of operation extremely dependent
on the technology both on aircraft and on ground. This is also the main reason for
autoland ights to be closely monitored with the ight data.
In order to explore the data and develop tools that allows the operator to moni-
tor the Autolands performance and detect any system misbehaviour all autolands and
Autoapproaches will be subjected to dierent algorithms conducting either to classi-
cation during the AGS ight processing, or later, through the extraction of a File with
Flight Data for analysis with other tools.
The Flight Data used to produce this le, and on dierent analysis on this project,
is completely described in Appendix A. Once this le is available with ight data,
it can be evaluated using the Standard Procedures classication methods, followed by
other concepts involving Statitical and Data-Mining techniques aimed to improve the
detection of abnormal Autolands. It must be emphasized that this task is performed
manually and this work aims to automatize it.
In Fig. 2.1 there is a summary of this process. The data starts to be recorded
on-board, which is known as Raw Data. AGS processing produces a classication and
15
exports the ight data le to be used on atypical or novelty detection algorithms. All
the steps are concern of this project, specially assuring that each one feeds the necessary
information for the next step.
The FMA displays to the pilot, the autoight and autothrust modes. From left to
right of Fig 2.3, each column represents respectively:
• Autothrust Modes
16
• Autoight Vertical Modes
• Autoight Lateral Modes
• Approach Capabilities
• Autoight Status
This listing has a decreasing order per amount of systems available. Having its max-
imum capability the aircraft can perform a CAT III Dual, which means no limitation is
present and both autopilots are engaged. If just one autopilot was engaged the indica-
tion CAT III Single was present but there was no limitation on approach and landing
capability, as the autopilot in charge can cope with the functions by itself. As there
are more system degradation on the aircraft, the indications become CAT II in a rst
step and nally CAT I. Each of these degradation steps produces a triple click aural
warning issued in the cockpit and this advise has the correspondence on the activation
of a bit on the information provided by the Autoight system. These bits are part of
the signals that will classify the Autoland as Poor Integrity (Fig 2.4).
The L3FO_INOP means that CAT III Dual is inoperative and the meanings are
similar to L3FP_INOP and L2_INOP for CAT III Single and CAT II respectively.
These bits are fully described in Table A.3.
Other than the system degradation, Poor Integrity identies whether there was any
warning during the landing. This information is provided by the remaining signals
present in Fig 2.4. AUTOLAND_ON is a signal that indicates when there is a fault
condition during the autoland, this includes the disengagement of one autopilot. The
17
rest of the signals are Master Warning (MW) and Master Caution (MC) warnings,
and correspond to red warnings (the most limitative faults) and amber warnings (less
limitative faults) being displayed in the front panel of each pilot (known as glareshield
panel).
During the descend, the autoland capability warnings, master warning and master
caution are checked for any activation. Together with the autopilot disengagement
logic, if any signal is triggered, it produces a classication of PI on the autoland. This
corresponds to the OR logic presented in Fig 2.4.
The Tables 2.2, 2.3, 2.4 and 2.5 are the checks that each eet shall meet, in order not
to be classied as Poor Performance Autoland. The fact that they have been provided
by the manufacturer on dierent dates the criteria is not a uniform one. It varies, even
within the same aircraft family (A319, A320 and A321). On an experimental basis and
in the context of this project, a uniform table shall be derived out of the tables above.
Any other conditions provided by the manufacturer can be programmed in the same
way as these that are being used for the sake of uniformity.
18
Table 2.5: Autoland Gating Checks for A330 and A340 Fleets
Decription Minimum Value Maximum Value
Rate of Descend @ 100ft (RALT) -19 ft/sec -6.5 ft/sec
Rate of Descend @ 50ft (RALT) + 3 sec -15 ft/sec -4.5 ft/sec
Rate of Descend @ TD -7.5 ft/sec 0 ft/sec
ROLL angle @ TD -4 deg 4 deg
PITCH angle @ TD 1 deg 10 deg
Time from RALT=50ft to TD 4 sec 11 sec
GS Deviation @ 100ft (RALT) -87.5 mDDM 87.5 mDDM
LOC Deviation @ TD -9.5 mDDM 9.5 mDDM
Cross Wind 0 kts 15 kts
Tail Wind 0 kts 10 kts
Head Wind 0 kts 30 kts
Each condition is identied with a label PPFn, with n representing one of the
Poor Performance conditions. Any value out of the minimum and maximum limits
from Table 2.6 produces one PPF condition producing an overall classication of Poor
Performance.
19
Table 2.7: Autoland Keypoints
KEYPOINTS
Criteria Value Description
100ft 100ft Radio Altitude
50ft 50ft Radio Altitude
Flight Path Related
30ft 30ft Radio Altitude
TD Touch Down Point
LNDTK Land Track Mode (≈ 400ft RA)
FLARE Flare Mode initiation
Autoland Related
ALIGN Align Mode initiation
ROLOUT Rollout Mode initiation
Other Criteria TH Threshold Crossing (Section B.6)
• General Data
• Air Sector Data
• Ground Sector Data
20
Air Sector and Ground Sector are considered as General data. The parameters that
may be relevant on both sectors are also considered in this section. As an example, jet
engine related parameters are N1 (fan rotating speed), N2 (core rotating speed), Fuel
Flow and Vibrations. The full description of the parameters is on Tables A.1 and A.2.
The le description (Table 2.8) indicates that it is composed by three data sets. The
rst is the identication of the ight, which is not used in this project to keep every
21
ight unidentied, the second is the snapshot data from each keypoint (Figures 2.5
and 2.6), which is repeated in each keypoint. The third is composed by General Data.
With such data structure, each landing is described by a set of 270 values.
Figure 2.7 decribes how the AGS Procedure separates the Autoland from the Au-
toapproach and the implementation of the Standard Procedure to each of the deter-
22
minations. The procedure scans the landings on a 8Hz rate, and a rst decision is
whether there was a Full Autoplilot Use during all the descend. This logic will be fully
described in Section 2.4.1.
After this rst assessment the concepts of Poor-Performance (PPF) and Poor In-
tegrity (PI) introduced in Sections 2.2.2 and 2.2.1 are applied. A deviation from the
limits established by the aircraft manufacturer, indicates a degradation of the perfor-
mance of the Autoland, and these are subjected to individual analysis.
In the case of Autoapproaches presented in Fig 2.7, there is not the classication of
Poor Integrity, due to the fact that by denition of Autoapproaches, the autopilot is
disengaged by the pilot. In this way at least one signal is always present enabling the
OR in Fig 2.4. Therefore, all the Autoapproaches would be classied as PI, and no
valued added information would come out of this classication.
These two items are highly improbable, and only spurious signals can produce this
indication. Due to this reason these situations will be considered as invalid.
The logic for the determination of the conditions above are presented on Table 2.9.
The mnemonics used in Table 2.9, are presented in the next Table 2.10.
Each truth functions on Table 2.9 can be simplied with the help of the Karnaugh
maps [21] yelding:
The Autoapproach logics (AAPR ) indicate that there was a disconnection of the
autopilots after LAND TRK (below 400ft AGL). This could be a voluntary action from
the pilot, to proceed with the rest of the landing manually or an instability condition
that caused the autopilots to disconnect. In the second situation, either the landing
is continued manually, or a Go-Around is performed. This means that the landing is
23
Table 2.9: Autoland and Autoapproach Truth Table
Mnemonic Description
AP1@LT Autopilot #1 Engaged at LAND TRK
AP2@LT Autopilot #2 Engaged at LAND TRK
AP1@TD Autopilot #1 Engaged at Touch-Down
AP2@TD Autopilot #2 Engaged at Touch-Down
ATL Autoland
AAPR Auto Approach
INVALID
discontinued in a perfectly safe way, and a second approach will be performed for the
landing to be nished. There is a ag in the output le (Table 2.8) indicating that a
go-around occurred (FLAG_GA).
If none of the logics ATL, AAPR or INVALID is triggered according to the logics
above, we are in the presence of a Manual Landing, i.e., the autopilots are not connected
on the test points, as a consequence the autoland modes did not appear, and the landing
was conducted manually. This is the majority of the ights in the operation. Despite
these ights are beyond the scope of this project, the AGS procedure (Appendix C)
24
Figure 2.9: Karnaugh Map for Autoapproach Decoding
also exports one le with the contents of the ight path keypoints presented in Fig 2.6,
which are the ones not depending on the autoland modes. This le is kept for future
developments for manual landing analysis.
The invalid condition present in Table 2.9 also leads to the creation of a le if the
conditions are met. There should never appear one entry in this le as it corresponds
to the autopilot engagement below 400 ft AGL or the swap of the autopilot engaged.
As there are recording errors, one of this condition may appear due to this fact and
the le created pinpoints the ights where this happens.
In short, the AGS autoland procedure described in Appendix C provides four Comma
Separated Value (CSV) les named:
• AUTOLAND_[eet].CSV
• AUTO_APPROACH_[eet].CSV
• REGULAR_LAND_[eet].CSV
• INVALID_[eet].CSV
25
following the logic described in this Section and the format from Table 2.8.
According to the conditions described, 354 ights have been classied as Autoland
and 513 as Autoapproach. Finally for the rst case there are 23 PPF and 17 PI. For
the second case a total of 174 PPF have been detected.
At this stage just the output le of AGS program is being used, i.e., no pre-processing
or outlier removal was performed. The conditions can be met either for a real case or
outlier data appearing due to noise.
There is no a-priori justication to have a certain amount of Autoland output out of
the processed ights, as it depends on the ceiling and visibility conditions on the des-
tination airports. Winter conditions are more prone to less visibility and to automatic
landings to be performed. Note that, as stated before, the majority of the ights are
manual landings.
The distribution of PPFs for the Successful Autoland detected is in Table 2.11
The most restrictive condition on Table 2.11 is PPF6. Checking where these results
appear (Fig 2.12), we can see that all the values beyond the gating condition are quite
close to the limit (6.5o ), which leads to the conclusion that this limit may have a
conservative value.
Having in mind that the origin of this limit is from tables 2.2, 2.3, 2.4 and 2.5 which
determine the limits imposed by the aircraft manufacturer, it can be seen that there is
still a margin to increase the limit of Pitch angle at Touch-Down. This is also justied
by the geometry limits for Airbus A320fam presented in Table 2.12, which limits Pitch
values for a Tail-Strike to happen [22]. Out of this table the most limiting value for
pitch is for Airbus A321 (the longest), with a value of 9.7o . Performing a balance
26
Table 2.11: PPF conditions distributions
between these two sources of information, a value of 7.5o will be set for the pitch limit
at touch-down in future analyses.
Table 2.12: Airbus A320fam Published Pitch Limits
27
With these criteria the number of cases to be checked by one analyst dropped from
354 to 23 + 17 = 40 (Fig 2.11) on the rst evaluation processing.
There is though a big incidence on Touch-Down values on the Poor Performance
criteria as presented in Table 2.6. Despite these values are of major importance for
the landing performance assessment, this analysis can be extended by the use of data
collected on the other keypoints during the approach and landing (presented before in
Figs 2.5 and 2.6).
As there are no reference values from the manufacturer on the remaining keypoints,
the determination of the distribution for a certain number of parameters shall be pro-
duced in the form of histograms. The presentation of the data in this way allows the
analyst to determine what are the values far apart from the average, and consider these
cases as Abnormal. These can be either outliers or real abnormal cases.
The rst diculty on having such an approach has to do with the number of param-
eters that dene each landing presented in Table 2.8. This is a set of 270 parameters
containing numeric data, strings and binary values. In order to evaluate the some pos-
sible methods to analyse this data, a sub-set of numeric parameters which are relevant
for the landing are extracted from the le generated in the AGS Procedure (Appendix
C). The le that is created is described in the next section.
28
Table 2.14: keypoints
Keypoints
Landing Track
100ft of Radio Altitude
Threshold
30ft of Radio Altitude
Flare
Touch Down
Mnemonic Description
DELTAT_LT Time between Land Track and Touch Down
DELTAT_TH Time between Threshold and Touch Down
DELTAT_LG De-rotation Time - Time between MLG Touch Down and Nose LG Touch Down
LAND_DIST Airborne Distance - Distance between Threshold and Touch Down
POOR_INTEG Autoland Performance according to criteria described in Section 2.2.2
POOR_PERF Autoland Integrity according to criteria described in Section 2.2.1
29
2.7 Outlier Identication and Removal
The histograms presented in this section highlight some cases where there are values
clearly beyond the limits. This study was applied to all the dimensions of the le
presented in Table 2.16. Figures 2.13 to 2.18 show the eect of removing the rogue
values from the initial 354 set of Autoland ights. The clean dataset is composed of
276 ights.
20
300
15
250
Frequency
Frequency
200
10
150
100
5
50
0
0
0 200 400 600 −16 −14 −12 −10
[ft/sec] [ft/sec]
Figure 2.13: Rate of Descend with Outliers Figure 2.14: Rate of Descend Clean [ft/sec]
[ft/sec]
30
100
25
80
Frequency
Frequency
20
60
15
40
10
20
5
0
−5 0 5 10 15 20 −4 −2 0 2 4
[mDDM] [mDDM]
Figure 2.15: Localizer @ Land Track with Figure 2.16: Localizer @ Land Track Clean
Outliers [mDDM] [mDDM]
The Outliers values may be originated by bad recording data corrupted by noise.
The examination of the outlier cases allows the identication of several causes that
origin extreme values to appear in the data set. One case identied was with one value
of Rate of Descend at Flare, that indicates −49f t/sec. This would indicate a value
of −2940f t/min in the vertical speed indicator presented in the cockpit. Making use
of the results obtained for this set of Autoland ights we can see that the average
radio altitude for Flare in 30f t, and the average time from Flare to Touch-Down is
4.67 seconds. if the value for Rate of Descend presented was true this time would be
30
Vertical Acceleration @ Flare Vertical Acceleration @ Flare
35
150
30
25
100
Frequency
Frequency
20
15
50
10
5
0
0
0.0 0.2 0.4 0.6 0.8 1.0 0.90 0.95 1.00 1.05 1.10
[g] [g]
Figure 2.17: Vertical Acceleration @ Flare Figure 2.18: Vertical Acceleration @ Flare
with Outliers [g] Clean [g]
less than 1 second. The data from this specic ight in the Flare area is presented in
Figure 2.19. In this gure, it can be seen that there are 4 seconds of recorded data
missing, exactly in the area where the are should appear. It can be seen in the last
column that data is missing from a radio altitude of 74f t till 18.5f t, which leads to
the Abnormal value presented.
Another way that Outliers can be generated is through errors produced during
calculations. An example of this situation can be seen in Figure 2.13. This histogram
is deformed due to the presence of four false values of rate of descend in the sample of
354 autolands. The rogue values are 444f t/sec, 458f t/sec, 692f t/sec and 740f t/sec,
clearly displaced of the average. The parameters from one of this ights is presented
in Figure 2.20, indicating where the threshold was detected (parameter TRESH ). It
is a premature detection when the barometric altitude is around 800f t and the radio
altitude indicates some instability which induced the error in the calculation of the
31
threshold crossing (Appendix B.6).
A third option for the outlier to appear is due to a real displacement from the average
values. This is the case of the localizer deviation presented in Figure ?? and detailed
in Figure 2.21. In this gure there are two regions where some interference appears
in the localizer signal. The rst around 3000f t of barometric altitude and the second
around 1500f t. It important to keep in mind that this dierence has a visualization in
the cockpit instrumentation of 0.4 dot approximately (77.5mDDM = 1dot in Localizer
- Section B.7.2). This value, despite being beyond the normal values, was not enough
either for the autopilot to disconnect automatically or manually by the pilots, and the
Autoland was performed with success.
The presentation of the data in the form of histograms, allow each of the parameters
to be compared within its equals of the whole set. This criteria looks at each parameter
individually and in case it is out of the criteria the whole ight will be agged as an
abnormal case. This is a valid procedure based on the composition of Univariate
32
Analysis.
The examples in this section show that there is an improvement on the detection of
abnormal cases with the new approach of Histogram Evaluation on each keypoiny. Due
to the fact that there is no condition for the Land Track or Threshold on the Standard
Procedure, the situations above have not been identied either as PPF or PI.
Once the Abnormal cases are identied within the set of 354 Autoland ights, they
are removed in order to produce a new set, in which these values do not aect the
average (µ) and standard deviation calculations (σ ). This new, clean set is intended to
be used in the evaluation of new cases, i.e, the parameters values of Autoland ights
are directly compared with µ ± nσ, n ∈ N to evaluate whether it is a value that can be
considered as Normal, having as a base the initial set of 354 ights.
These criteria are used to identify the ights that divert from the average (µ) of the
remaining ights or are marked with PPF or PI. In practice it is created one extra
column in the output le from the AGS Procedure, named Label, that will contain the
value yi = 1 for the Normal cases and yi = −1 otherwise. When at least one condition
happens, the label will become yi = −1 performing in this way the logic OR of the
criteria indicated above.
The limits of ±2σ superimposed in the histograms, can be used as a reference that
establish the frontier to separate the Normal from Abnormal cases. The goal for this
identication is to decrease the workload of the analyst and let him/her to be focussed
only in the most extreme cases.
Up to this point the methods applied consisted in the analysis of a single param-
eter at a time, which is one Unidimensional analysis, i.e., each parameter is tested
individually, being the nal labelling a combination of these results (in this case a
OR ).
There is though the concern that the correlations between the dierent parameters
could add extra information to the results or put in evidence other features of the
data. This leads to the Multivariate approach to this problem which is described in
the following section.
33
2.9 Multidimensional Evaluation
The studies performed in the previous sections (2.5 and 2.6) are based on the composi-
tion of single parameter analyses. In this section, analytical tools from the Multidimen-
sional or Multivariate Analysis [23] [24] [25] will be used in order to ease the workload
to detect Abnormal patterns in the Autoland data. Under this context, each parame-
ter is now referred as a dimension of the space Rm which in this case corresponds to
m = 42. The most evident limitation when working with a higher dimensional space
is the loosing of graphic capability to describe the data in a straightforward way.
In the previous section, the solution was to plot all the dimensions in a histogram
per each dimension. It is though easy to see that this is a time consuming task, and
more dicult as the dimension gets higher. In this section a study to overcome these
diculties is presented. The method to be applied to the Autoland Data Matrix with
dimension 354×42 is the Principal Component Analysis (PCA). Each row of this matrix
is now considered as an observation of a m dimensional random vector. The PCA is
applied to Autoland data and nally a graphic evaluation of the multidimensional data
is introduced, the Biplot graphical presentation.
1 1 X−1
fθ (θ) = P exp(− (θ − µ)T
(θ − µ)) (2.1)
2π m/2 | |1/2 2
P
where θ is a m dimensional column vector, µ represents the mean and is the
Covariance matrix of Xn×m , dened as:
σ1 2 σ12 . . . σ1m
X σ21 σ2 2 . . . σ2m
=
... .. .. .. (2.2)
(m×m) . . .
σm1 σm2 . . . σm 2
34
Pn
− x)(yi − y)
i=1 (xi
σxy = (2.3)
n−1
Let the data be organized in a matrix Xn×m such that each column represents a
dimension (data component) and each row represents a sample point. The rst step
consists in obtaining one estimate of the mean (µ̂) for each dimension,
1X
n
µˆj = xij (2.4)
n i=1
X
d 1
= (x − µ̂)T (x − µ̂) (2.5)
m−1
where (x − µ̂) represents the subtraction of µj from each xj column (centering data).
Dening Z = X − µ̂ and using the SVD factorization, then Z = UΛVT , where U
and V are orthogonal matrices with n × n and m × m sizes respectively and,
λ1 0 ... 0
0 λ2 ... 0
.. .. .. ..
. . . .
Λ, 0 0 . . . λm (2.6)
0 0 ... 0
. .. .. ..
.. . . .
0 0 ... 0
P
It follows from Equation 2.1 that (θ − µ)T −1 (θ − µ)) can be written as
(θ − µ)T V(ΛT Λ)−1 VT (θ − µ)), where V contains the eigenvectors to transform the
data, ΛT Λ contains the eigenvalues that are used to select the principal components
and
W , VT (θ − µ) (2.7)
represents the transformation that is used to detect patterns in the space generated
by the Principal Components. Note that ΛT Λ is a diagonal matrix which can be
interpreted as the covariance matrix of the uncorrelated statistic variables {wj } with
λ2i representing its variance.
This method can be used to classify the data for new Autoland ights in the following
way:
35
1. Select a set of Autoland ights that represent the Normal performance.
P
2. Apply the PCA to the set dened in the previous point by computing µ̂, c, V
and {λ2i }.
3. given a new Autoland, θ, use Equation 2.7 to compute w and compare wj with
λ2j , j = 1, 2, ..., m to classify the performance of the Autoland.
The application the the Autoland data matrix is produced with the script presented
in Appendix ??. This analysis provides 42 Principal Components, which is the space
dimension, that follow the decreasing order of Variance shown in Figure 2.22 1 .
2.0
Standard Deviation
1.5
1.0
0.5
0 10 20 30 40
Primary Components
The Analysis of this Figure, shows that the higher the Principal Component is, the
less it contributes to the total Variance of the set. The eect of Principal Component
Variance contribution, can be seen using the Cumulated Variance which is dened for
the rst k principal directions as:
Pk 2
i=1 (σ )
CU M _V ARk = Pi (2.8)
trace( )
The graphical representation of Equation 2.8 is presented in Figure 2.23 and calcu-
lated in Appendix ??.
Being one accumulated value, each Principal Component adds its Variance to the
total Variance of the set, till the value of ”1” is attained if all the 42 principal directions
are taken into account (Figure 2.23).
This graphic provides the information about which Principal Components from the
set are relevant to describe the full set without loosing much information. In this
case, for the Autoland data, it is represented in Figure 2.23 an horizontal line that
corresponds to 93% of the total Variance of the set. This occurs for the 24th Principal
Component, which means that the dimension of the space can be reduced from 42 to
24, keeping 93% of the total variance of the set. The remaining 18 dimensions just
cope with 7% of the total variance.
1 The Variance follows the same line as the Standard Deviation because V AR(X) = σx2
36
1.0
0.8
Cumulative Proportion of Variance
0.6
0.4
0.2
0 10 20 30 40
Primary Components
X̂(n×m) = Y(n×2) ZT
(2×m) (2.9)
Where Y is known as Row Marker Matrix and Z the Column Marker Matrix. Keep-
ing just two dimensions is one approximation that corresponds to use the rst two
principal components or eigenvectors to describe the data. This loss is compensated
by gaining the capability to represent the data graphically. The Biplot graphic is con-
structed with the plotting of the rows of matrix Y that represents the points of the
data and the columns of matrix Z that represents the projections of the remaining
principal components on the plane dened by the rst two principal components (see
Figure 2.24).
The conversion of the data into the form of Equation 2.9 can be achieved by factor-
izing the data matrix X with the Singular Value Decomposition (SVD), already used
in the previous section,
X = UΛVT (2.10)
Taking the rst two columns of U, yelding K = ΛVT and considering its rst two
rows, the Equations 2.9 and 2.10 can be compared, resulting:
37
Y(n×2) = Û(n×2) (2.11)
ZT
(2×m) = K̂(2×m) (2.12)
The rst Biplot graphic produced is in Figure 2.24. This image is one display of
the Autoland data matrix with dimension (354 × 42) without any dimension reduction
provided by the Principal Component analysis study (Appendix ??).
5000
0.1
33 210
18
189
246
192 244 219 307 99
352103
340
148 70245
129
206
287
266
350 187
132
207
123
349
114 195
83 213
305 95
332
301 299
180229
60120
348 110
351
118 212
25 152 173
260
339 96
344
250
274
73 338185
262
44
117
347
104
345
196
59
327
158
36108 42
124
94
168
333 306
113
271
202
52 7255 3
252
292
208
220343
277241 141
303
51
88200
216
251
261
296
199
289 119
145257
176
7775147
342
151
139
214
184
279
133
26562 64 136 4393 56
Comp.2
69 291
346218
227 191781281
263 82 126
267125174
87 2114197
231
67 30
211 171 53
0.0
143
47284258336
302 2 330
181
71
280
223221
232
354
2492352
297 36 247194
131
328 74166 326
GS1_100
121
ROD_TD
PITCH_TD
PITCH_FL
PITCH_TH
PITCH_LT
PITCH_30
FPA_LT
FPA_100
PITCH_100
FPA_TD
FPA_FL
FPA_TH
FPA_30
ROD_LT 63
224 35
32 16929
127
198 81 34300
0
283243
156 157
298
107
331 DELTAT_LG
VRTG_FL
LOC1_FL
LOC1_LT
ROD_100
LOC1_30
POOR_INTEG
POOR_PERF
VRTG_TD
VRTG_30
LOC1_100
VRTG_LT
VRTG_100
VRTG_TH
LOC1_TD
LOC1_TH
ROD_30
ROD_FL
DELTAT_LT
GS1_LT
204 230
DELTAT_TH
GS1_TH
48 80
GS1_FL
GS1_30 102
310
270
138
27323272 57
233
GS1_TD 203
16 89
79 91242
205
140
45100
128
315
253 222
309
115 39286
85 97
225
159 98 116
167
ROD_TH105
290
101
234
144
334
130 318324
314 186 135 23910
329
90
109
154 209
275
22 190 269142
61
153 201
28183 111
304 316 112 293 12
50 49 38
40 106 21554 276 325 321 353268
6 335
58 41 320
264 175 162 319163 84
285 248 68 161 226
179
13
313
164 27
288188 19 312
26 LAND_DIST
322 149
65 228
323
240 11
282 341
295
17 155 237 150 160238 178
8 66 20 46 217 172
137255 86182
294
24 134122 5 256 177
311
308 278
7
−0.1
4
337 317
76
259
−5000
92
170
−0.2
15
9
37
Comp.1
The analysis of this graphic clearly highlights the relative position of some vectors
from the set of the whole 354, that are out of the cloud comprising the cases that can
be considered as Normal. these vectors are resumed in Table 2.17.
Table 2.17: Biplot Graphic for the 354 Autoland detected
If the criteria is tightened, some more cases can be chosen on the border of the
cloud, but those referred are clearly identied from Figure 2.24.
38
Dimension Reductions
The Principal Component analysis from Section 2.9.1, provides the results that shall
now be applied. The conclusion from this study was that 24 dimensions could represent
93% of the Variance of the whole set. On the other hand, Figure 2.24 shows which cases
are considered out of the Normal situation without any reduction on the dimension
of the set. Building the Biplot graphic for the rst 24 principal components results in
Figure 2.25 (code in Appendix ??). It can be veried that the same vectors presented in
Table 2.17 fall outside the main cloud, which conrms the assumption that the original
space dimension can be reduced to 24 dimensions.
15 37
9
3000
0.2
92
2000
170
76
0.1
337 259
24 160
1000
317 294
4
155 308
134122 86 11 311 278
177
866
313 1644627 137255 5 161
182
13 162 26
288 19 172
17 20 507 188217 54 149 68 178 323 256 295
49 38
237 190
140
106 150
175 84215
163
Comp.20
23865
322
314112 6 240
282 41 341
226
264 319
142 135
40 9091 315
275 115
Comp.6 80
285
167
105
101 138276Comp.1
312 321 353 239
Comp.2
154
109 209128
61
269
309
39
28 230
Comp.8
111 334 329
10268320
89 22 242
45
205
354
100153
253
107 302166
159
Comp.17
304116
Comp.19 53234293
144
58
57
127
233
6412 228 Comp.14
81 179
0.0
0
243
280 157
47 218Comp.5
247
Comp.16 201
281 77
121
Comp.24
290 63
224 29
16 221
69292
249 235 298
250
346227
Comp.10 328
17126374
289 2102
326
1262167
33035 Comp.18
32
181
272 186 56
300
Comp.21
Comp.12
283 252284 131
Comp.9
78
Comp.3
51
88 262
344258 1267
261
Comp.2 75
174
125
176 14
310 71
214
26562
143 241
Comp.7 19982 119
Comp.4
145
36
345 139
151 52
342
212
257
168 306 Comp.23
93
208 220 73
200
194
191 347
216
185
301
351 333
10894 229
202 Comp.22
136
7255 173 30
260
339 141
277 274 44
251
303 117 124
113
327
147
42
158 152
60
197
133
279184 433
96 343 266
118
196
350
332 195
83 271 120
207
206 180
25
348 338 296352 187
349299 305
−1000
123
340
1147059 95
244110 287
104
148
307245129213
192 103132
189
219246 99
−0.1
33
18
210
−2000
31
−0.2
−3000
146
193
165 254
Comp.1
There is though the concern on which components are the most representative of
the set. Shall it be the ones that have a high Variance or the lowest ones. This is the
same as knowing which represents more information for the set, the variables that have
a high Variance or the ones that vary on minimum quantities. To have one assessment
of the answers to this problem, the same study was conducted for the lowest Variance
components, and the result is presented in Figure 2.26.
Both studies on Figures 2.25 and 2.26 have been conducted with the support of
Figures 2.24 and 2.23. In the last case of low Variance, there was a successive incre-
ment of the dimensions being considered, till the same vectors of Table 2.17 become
evident. This was achieved with the lowest 36 Principal components. In conclusion,
to reach the same results one needs more 12 components when using the lowest prin-
cipal components instead of the highest, which means that there is no benet on this
approach.
39
Lowest Variance Components
−2000 −1000 0 1000 2000
31
193
254
165 146
0.2
2000
0.1 210 189 33
18 246
307 244
99245 340352
266
206
219
207 301 192
229
305 195
129187
70
83103
349148
350
332
123
173 152
9555
120 60212
Comp.25
306 132
180 287
114 351 73 250
347
213
72 29994117
168
333
327
202 25
124
11342 118
36185 274 344260
56 214 52
271158
231 345261 Comp.7
262 348 339
933Comp.22
Comp.23 59108
257
Comp.26
342 77
119
196 289
110199
21644 22096252292
Comp.2
64184265
279
62
133147176
139
53 75
21
151 145
126
87
104 251
3381
Comp.9
7851Comp.10
141 88 291 Comp.12
69
211
Comp.21 43
136 127330 67
14 125
1742230
267
281
296
82
166 302
Comp.33
263
303336
343 218
277
227 346208354223Comp.11 Comp.13
232
0.0
30 29169
198
57 197
181 Comp.36
Comp.37
Comp.24
138 326
Comp.4080 200171
Comp.16
Comp.29 241
24728447
236 280
221
34 81Comp.18 35
32
71314
233 Comp.31
Comp.30
224
63 74 191 128297
235
143
0
10 135 324
Comp.15 23
234
270 121
Comp.34
102 204
48
Comp.17 328
131
194258 157
107
315
331242243
140 249
156
239
329 272 273
Comp.42
Comp.35
Comp.32
310
144
116 98
167
Comp.41
105
Comp.38225 222
15939
97 309 298
115 100
253 1902059189283
300
Comp.14 293
161 334
318 Comp.19
112 Comp.8
101
Comp.39 85 203
154
10990
41 353
11
186 Comp.27
12 6 68
321
130
27631654 111
215
30484319
290
142
Comp.6
286 269
28 162 209
106
275 45
61175 22
153 38
79 1650
49 40
320268 282 58 149Comp.20 201 288
18326Comp.28
163 26427 155 13
313
179
341
295 226
323335325 312 19160
285
188 164
240 65 322
178
294 248
238 150 17
668
228 172137 237 46 2420
182
5 255 122
86 134 217
177 256
278 311 308
−0.1
4 317 337 7
76
−2000
259
92
−0.2
170
15
9
−4000
37
−0.3
Comp.1
2.10 Summary
In this Chapter an Autoland analysis was created producing one output le that is to
be used on the remaining of the project for normality evaluation. This le is composed
with 354 autolands and 42 relevant dimensions during landing have been extracted.
On this le it was applied some methodologies to detect Abnormal cases. The
methods tested are:
• Standard Procedure
• Histogram Evaluation
• Principal Component Analysis (PCA)
In a perspective to create one complete solution, i.e., one that other than implement-
ing the Autoland analysis, implements one automatic classication of new analyses
produced, a complement of tools and methods are used to achieve this goal. Up to this
stage, AGS produces the Autoland analysis le that is further investigated with data
mining tools outside AGS. In this case tools like Matlab, Octave or the R statistical
Environment allow a deeper investigation on the AGS le results. The methods above
identied Abnormal cases each one according to its criteria. These cases are identied
in the generated le with a eld containing a Label.
The automatic assignment of such labels is still one part of the work that is to
be studied in the remaining Chapters. One of the state-of-the-art areas that deals
with the classication of Multivariate Data is the area of Kernel Methods and Support
40
Vector Machines. In the References there are several works that indicate the trend
to use this kind of techniques with Flight Data Analysis, specially the work from
NASA in references [15] and [28]. Also the work presented in reference [17] presents
one comparative study of dierent methodologies to analyse Flight Data in order to
detect Abnormal behaviours. This study indicates that the Support Vector Machines
algorithms have a suitable performance to be used for binary classication, i.e., the
distinction between two states such as Normal and Abnormal.
There is a benet for the airline, and for the analysis that all of these situations are
identied in a in a precise way, so that the analysis eort can be reduced by focussing
directly the cases that may have additional information, or may be problematic. Out-
liers are in this context identied as cases clearly distinct from the remaining present
in the data set.
The next Chapter gives a presentation about the theory behind the concepts of
Kernel Methods and Support Vector Machines.
41
Chapter 3
x1,1 x1,2 . . . x1,m , y1
x2,1 x2,2 . . . x2,m , y2
(xi , yi ) ≡
... .. .. .. .. (3.1)
. . . .
xn,1 xn,2 . . . xn,m , yn
42
inner product shall be dened on this space with the notation w·x ≡ wT x. Having the
inner product dened, the metrics of distance and angle between vectors arise naturally
from this denition. A space with these characteristics is known as an Euclidean space.
The calculations that will be presented in the following sections make use of the dot
product and space metrics of distance.
In the Matrix 3.1, each column of the Input Space is known as a feature. In the case
of the project here presented, the features are the extracted parameters according to
the algorithm presented in section 2.3.4.
The set Y , for a multiclass classier, is a k dimensional space representing k classes
for grouping data. In the case of binary classication, which is the focus of this project,
we have Y ≡ {±1} and only one of these two labels are assigned to each vector from
Matrix 3.1.
To apply the Supervised Learning concept to the ight data in practice, there are
three sets of data that will be used during this project. These are:
The goal is to estimate the relationship between inputs xi and the corresponding
labels yi . The eort for having a suitable SVM is to have estimator function f that
correctly predicts the label yi for a given input vector xi (Equation 3.2).
f : X ∈ Rm −→ Y ∈ Rk (3.2)
The typical use of a Support Vector Machine (SVM) involves two steps. First to
train the data to obtain a model, and second using this model to predict results on a
testing data set [32].
43
Figure 3.1: SVM Principle - Binary Classication Hard Margins
wT xi + b = 0 (3.3)
The setting of the scale is performed by assigning wT x + b = 1 for the positive side
and wT x + b = −1 in the negative case. These two hyperplanes dene the maximum
margin possible, and are known as Canonical Hyperplanes. The region that is located
within these hyperplanes is dened as the Margin Band (Figure 3.1).
On one side of the hyperplane we have wT xi + b > 0 (stars), and on the other
side wT xi + b < 0 (squares), which leads to the decision function (Equation 3.4) to
determine in which side will be any new input vector x.
Considering the value of yi on each side of the hyperplane, the relation 3.5 is always
valid, as this is a product of either two negative numbers or two positives.
Picking any two vectors that lie on each of the Canonical Hyperplanes such as x1
and x2 (Fig 3.1), the Equations wT x1 + b = 1 and wT x2 + b = −1 are valid, by
denition of the hyperplanes.
44
Subtracting both equations yields the result presented in 3.6.
wT (x1 − x2 ) = 2 (3.6)
Dividing Equation 3.6 by kwk we obtain the projection of vector (x1 − x2 ) over the
unitary normal vector w/kwk (Equation 3.7). This corresponds to the margin ρ, that
has to be maximized.
w 2
(x1 − x2 )T = (3.7)
kwk kwk
2
ρ= (3.8)
kwk
From the result presented in Equation 3.8, it is easy to conclude that the maximiza-
tion of the margin ρ, is equivalent to the minimization of kwk, which is rewritten in
Equation 3.9, known as Objective Function.
1 1 T
min kwk ⇔ min w w
2
(3.9)
2 2
Keeping in mind that for the present case with no noise all the points satisfy the
condition |wT xi + b| ≥ 1, when multiplied by the labels, a new constraint to the
Objective Function appears in the form of Equation 3.10.
Equations 3.9 and 3.10 represent a quadratic optimization problem with constraints,
which is solved using Lagrange Multipliers. The Lagrangian Function is thus repre-
sented by Equation 3.11. This is a convex function in which the rst member is the
Objective Function, the second corresponds to the constraints and αi ≥ 0 are the
Lagrange Multipliers.
1 X n
L(w, b, α) = (wT w) − αi (yi (wT xi + b) − 1) (3.11)
2 i=1
45
∂L X n X n
=0⇔− αi yi = 0 ⇔ αi y i = 0 (3.12)
∂b i=1 i=1
∂L X n X n
=0⇔w− αi yi xi = 0 ⇔ w = αi yi xi (3.13)
∂w i=1 i=1
By replacing Equations 3.13 and 3.12 into 3.11, the primal variables w and b are
eliminated (see 3.14) [34].
1 X n
L(w, b, α) = (wT w) − αi (yi (wT xi + b) − 1) =
2
! n ! " i=1
! ! ! #
1 X
n X X n Xn
= αi yi xi αj yj xj − αi y i αj yj xj xi + b − 1 =
2 i=1 j=1 i=1 j=1
1 X
n X
n n X X
n
= αi αj yi yj (xT
i xj ) − α i αj y i y j (xT
i xj ) − b αi yi + αi
2 i,j=1 i,j=1 i=1 i=1
(3.14)
The result of this calculation is known as the Dual Formulation of the problem
W(α) represented in Equation 3.15.
X
n
1X
n
W(α) = αi − αi αj yi yj (xT
i xj ) (3.15)
i=1
2 i,j=1
The Dual Formulation (Equation 3.15) has the advantage of normally being easier
to solve than the Primal Formulation (Equation 3.11), as the optimization is produced
using the dot product of the input data (xi ) and the corresponding labels (yi ). For this
reason the construction of this problem is oriented to obtain the solution of the Dual
Formulation.
According to the theory optimization with constraints, the optimum solution for
both Equations 3.11 and 3.15 is the same and will represented as (w∗ , b∗ , α∗ ). This
point is known as the Saddle Point of function L(w, b, α), which is obtained by the min-
imization of the function regarding the primal variables (w, b), and the maximization
in order to the dual variable (α). It satises the following two conditions (Equations
3.16 and 3.17):
46
L(w∗ , b∗ , α∗ ) > L(w∗ , b∗ , α) ⇔ W(α∗ ) > W(α) ⇔ maxW(α) ∀α (3.17)
α
The importance of the Saddle Point is that it is a sucient condition for the existence
of a solution [35]. If (w∗ , b∗ , α∗ ) is a saddle point for function L(w, b, α) , then w∗ is a
solution for the Objective Function. Moreover if this function is convex, the solution is
unique.
The set of necessary and sucient conditions for the existence of an optimal solution
is, according to the optimization theory with constraints, known as Karush-Kuhn-
Tucker (KKT) conditions. At the optimum point, the conditions from 3.18 till 3.22
are satised. Assuming the KKT conditions are met, we can say that the solution
(w∗ , b∗ , α∗ ) exists and is the same for both primal and dual formulation.
αi∗ ≥ 0 (3.18)
X
n
αi∗ yi = 0 (3.19)
i=1
X
n
∗
w = αi∗ yi xi (3.20)
i=1
47
Figure 3.2: αi for sample vectors in input space
out for the determination of w∗ (Equation 3.20), and later, as it will be shown, in b∗ .
The result is the best separating Hyperplane, w∗ xi + b∗ = 0.
As for the bias b∗ , it will be found separately on each side of the hyperplane [29].For
the vectors labelled as yi = +1, the equality min(wT xi + b) = 1 is valid, as long as for
those with yi = −1 the valid equation is max(wT xi + b) = −1. Using the result from
Equation 3.13, the following relations 3.23 and 3.24 are obtained.
!
X
n
b = 1 − min αj yj (xi T xj ) , yi = +1 (3.23)
j=1
!
X
n
b = −1 − max αj yj (xi T xj ) , yi = −1 (3.24)
j=1
Summing both Equations 3.23 and 3.24, b∗ is obtained from the optimal solution α∗
(Equation 3.25).
" ! !#
1 Xn Xn
b∗ = − max α∗ yj (xi T xj ) + min αj∗ yj (xi T xj ) (3.25)
2 yi =−1 j=1 j yi =+1
j=1
Arriving at this point, the decision function from Equation 3.4 will will be dened as
in Equation 3.26 for the optimal solution. Any new vector x will be classied according
to the best hyperplane separating the two classes, represented by Equation 3.26.
48
!
X
n
f(x) = sign(w∗T x + b∗ ) ⇔ f(x) = sign αi∗ yi (xi T x) + b∗ (3.26)
i=1
It is important to note that the predictions performed with Equation 3.26 depend
only on the inner product between x and the Support Vectors as αi = 0 for all vectors
from the Input Space which are Non-Support Vectors. In this way, many of the terms
used in the sum will be zero.
The presence of noise in the data is taken into account by the introduction of non-
negative slack variables ξi into the objective function 3.9 and the constraint Equation
3.10. This new approach leads to Equations 3.27 and 3.28.
yi (wT xi + b) ≥ 1 − ξi , ξi ≥ 0 , ∀i (3.27)
" #
1 X n
min wT w + C ξi ,C > 0 (3.28)
2 i=1
49
The minimization of kwk corresponds to the maximization of the Pmargin. The new
objective function (Equation 3.28) incorporates the new term C ξi , which being
added to kwk will produce a change on the size of the margin. Considering thatPthe
whole Equation 3.28 is to be minimized, as the value of C increases, makes that ξi
has to be the smallest possible, in this way resulting in a solution with the minimum
amount of errors due to noise [30]. The value of C regulates the weight of the errors
and for this reason is known as Regularization Parameter.
Applying again the Lagrangian primal formulation to the objective function 3.28
subject to the constraints of presented in 3.27, the result is:
1 X n X n X n
L(w, b, α, ξ) = (wT w) + C ξi − αi (yi (wT xi + b) − 1 + ξi ) − ri ξi (3.29)
2 i=1 i=1 i=1
∂L
= 0 ⇔ C − αi − ri = 0 ⇔ αi = C − ri (3.30)
∂ξ
As ri ≥ 0 and C > 0 the Equation 3.30 yields always αi ≤ C . This results presents
an additional limitation to αi . In this way, αi is limited by 0 ≤ αi ≤ C which is known
as the box constraint.
Continuing to apply the same procedure as in section 3.2, the Dual Formulation
of Equation 3.29 is obtained by the replacement in this equation of all variables w,
b and ξ . The resulting formulation is in Equation 3.31, and the new corresponding
constraints presented in Equations 3.32 and 3.33.
X
n
1X
n
1 X 2
n
W(α) = αi − αi αj yi yj (xi xj ) − α (3.31)
i=1
2 i,j=1 4C i=1 i
0 ≤ αi ≤ C (3.32)
X
n
αi y i = 0 (3.33)
i=1
50
As in the previous section (3.2), this formulation will lead to the best separating
hyperplane, w∗T x + b∗ = 0 , where w∗ and b∗ are calculated after the value of αi∗ is
determined solving Equation 3.31, with constraints 3.32 and 3.33.
The value of w∗ is still provided by Equation 3.20, and the decision function 3.26 is
the same to be used with the current data. Also for this formulation, the Non-Support
Vectors are those for which αi∗ = 0. Considering the limitation for αi∗ presented in
Equation 3.32, the Support Vectors are the remaining for which 0 < αi ≤ C .
In order to make use of the results presented in the previous sections (3.2 and 3.3),
where the linear separation was possible, the structure of a inner product has to be
51
dened in the Feature Space H. This will allow the direct replacement of the dot
product in the Input Space according to Equation 3.35, since the Dual formulation can
be written entirely in terms of the dot product. The fact that the same data in another
space can be linearly separable, makes easier the calculus involved and it allows the use
of the results from linear algebra in a set of data that originally was not in a suitable
conguration for the results to be used directly.
i xj −→ Φ(xi ) Φ(xj )
xT T
(3.35)
It is the dot product Φ(xi )T Φ(xj ) that is called Kernel Function, K(xi , xj ) (Equation
3.36).
All the results from the previous sections can be used, provided that the dot product
of input vectors xi xj is replaced by the corresponding dot product of the vectors in the
Feature Space, Φ(xi )Φ(xj . This is the same as replacing by the Kernel, K(xi , xj ) and
this is the core of the Kernel Trick.
Provided there is one Kernel Function that has the properties of an inner product,
it is enough to dene the Kernel and it is not necessary to know the mapping Φ(x),
as this function is implicitly dened by the Kernel. This is an extra simplication of
the algorithm, based on the fact that the inner product structure is maintained in the
Feature Space.
In this way, after the Kernel substitution, the dual formulation derived in section
3.3 (Equation 3.31) will be represented as the following Equation (3.37).
X
n
1X
n
1 X 2
n
W(α) = αi − αi αj yi yj K(xi , xj ) − α (3.37)
i=1
2 i,j=1 4C i=1 i
The solution of Equation 3.37 will yield the value for α∗ , and from this the best
separation hyperplane w∗T x+b∗ = 0 using the same results from the previous sections.
The constraints from Equations 3.32 and 3.33 are still maintained, and the decision
function presented in 3.26, becomes
!
X
n
f(z) = sign αi∗ yi K(xi , z) + b∗ (3.38)
i=1
52
• Linear Kernel: K(xi , xj ) = xi T xj
• Polynomial Kernel: K(xi , xj ) = (γxi T xj + r)d , γ > 0
• Radial Basis Function (RBF) Kernel: K(xi , xj ) = exp(−γkxi − xj k2 ), γ > 0
• Sigmoid Kernel: K(xi , xj ) = tanh(γxi T xj + r)
This multitude of choice for the Kernel is another advantage for using the Support
Vector Machines. In this project the tools used have dierent kernels that can be tested
with the Flight Data available, but it is general idea in the references available ( see [36]
for example), that the Radial Basis Function (RBF) Kernel is a good starting point.
53
Chapter 4
Presentation of Results
In this Chapter the methods presented in the previous chapters are going to be imple-
mented and the classication of novelty data shall be produced. The methods are:
The set of ights that are to be used as a standard is the 354 Autoland ights
studied in Chapter 2. Out of this set a new one is produced only with those ights
within a window of ±3σ on all of its dimensions. This is a clean set that produces the
averages (µi ) and standard variations (σi ) for the new vectors to be compared with.
The gure used as a comparison for the rst two methods is represented in Equation
4.1, representing the mean distance for a new vector z. Each component of this vector
is related to the original vector x by zi = xi − µi .
v
u
u1 X m
zi2
dmed = t z ∈ Rm (4.1)
m i=1 σi2
The SVM model is to be implemented with the Software LIBSVM [32]. Its descrip-
tion, use and capabilities are fully described in Appendix E. The typical use of a SVM
algorithm involves two steps: rst, training a data set to obtain a model and second,
using the model to predict or classify the information present on a testing set. These
steps are summarised in Figure 4.1. Step #1 aims to create the model based on already
known labels. This model is then applied to Unlabelled data as in Step #2, producing
the output of Predicted Labels for the testing data. This second step is repeated as
long as there is data to test, whereas the rst step is run once to produce the model.
Several support programs and script les have been created during the course of
testing the data as a result of specic needs. These are presented in Appendix D.
54
Figure 4.1: SVM Model Creation and use
301
56
8
60
96
Mean Distance
Mean Distance
6
40
52
4
83
20
74
72 297
434855 78
2
169 130
124 217 288 321
19 44 64
42 67
70 91102 110123 145 165 170 210 303312 326
331342
31 4047 6371 80 92101111
495459 117125 156 173184 199 254265
246 261 276 316 332
334
330
344 374
370
6 20 36
384553 616873
60 82 97
81
79 105
99 109
116 127
132
134 174
168
167
162 180 200 213224 244
209 260 283
255 295 315 320329 347
345
337
124 91321
172330
2633 5057 66
37 46
77
6975 84
86
90100
98108115
112
114 131
118128
133
149
142
140
143152
153
154
151
166
160171
179
175
185195
194
188
183
186
215
214
227236
228
223 235
251
245
249258
252 267
259
253 275
274
279
278
291
292
287
286
324336
318
322
328339348
346
356 373
361
571016 2532 41 62
65 76 95 113
107 120
8994 106119 135 148
146
144
137 158
155
159
161 176 190
193 208
201
207
205
203
211 225
232
230 242
241
234
238 250 266
262
268 306
304
302313
308
299 314323
325
333 359
360
363
357
353 366
56 1218222835 87
8893 121
129
122 147157164
172181
182 198
192
191202 218
206216 229
221231
233240
247257
263
264
256 273
277
270280
282
281 296307
309
310
311
317 335
341 352
350 364
358
354362
3 811
15
14 24293439 5158
27 85 103 126 136
139
138 177
178 189
187 204
197
196 219
220
222
226
243
237
239248 269
271
272 284
289298
293
290300
294
327
319 340351
343
338
372
355365
349 375
371
368
367
96 104 141150 163 212 285 305 369
52
74
72 83
78 297
123456710
91219
20
17
13
1621
23
1826
2531
3036
38
33
323743
44
42
40
41 48
49
47
45
50
46 55
54
5359
60
61
57
6267
64
63
6570
71
68
6673
69 80
7682
81
79
77
75 84
8691
92
90
89 102
101
97
98
95 110
111
105
99
100109
108117
116
115
107
106 118
112
114
113
120130
124
123
125
127
132
134
131
128
133 145
140
142
135143 156
149
152
153
154
151
148
146
144
137 160
158
155
159
161170
165173
168
166174
167
162171
175184
180
179
176185 199
200
195
194
188
183
186
190
193 208
201
205
203 217
210
213
209
215
214
207
211 224
227
228
223 235
232
225
230 246
244
236245254
255
251
249
242
241
234
238 250 265
261
260
258
252
259
253 267
266
262
268276
275
279
274
278288
283 295
291
292
287
286 303312
306
304
302
299 321
316
318
313
308 326
320
315
322
314 331
330
332
334
329
324
328
323
325333342
337
336344
347
345
348
346
339 356
361
359
360
363
357 374
370
373
366
81115
14 2228
24
27
2935
3439 5158 87
88
85 94
93103
104 119
121129
122
126136
139
138147
141 157
150 164
163172181
182
177
178 189
187 198
192
191202
196 206
204
197 218
216
219
220
212 229
221
222231
226 240
233 247
243
237
239248257
256263
264273
270
269
271280
277
272282
281
284 296
298
293
290
289 307
300
294 309
310
311
305 317
319327335
341
340
343
338 353
352
350
351
355
349 364
358
354362372
365375
371
368
367
0
285 369
0
Figure 4.2: Input Space - 2011 Autoland Figure 4.3: Input Space - 2011 Autoland
Flights Comparison Flights Comparison (zoom)
55
4.2.2 Principal Components ±3σ Evaluation
Changing the base to the principal directions, the results are presented in Figures 4.4
and 4.5. Equation 4.1 is now calculated for the components of the new base.
Principal Component Space (2011 Autolands) Principal Component Space (2011 Autolands)
120
10
301
100
8
80
Mean Distance
Mean Distance
6
52
72
60
169
47
91
92 125 217
83 124
4
43 81
111 152 244
70 82 283
40
2
1321 41 5762 66 179 192 218 296 310 359
30 3945 55 65
26 8895105
107 170 191 208
203212 239 255 294 306 326 333 350
343 362
357 371
368
360
108120 136144 168177
172 205
209 230 231
234 335 355365
336 353 374
20
11
12
71017 27 42 58 85 126 137 178
171 193
194
189 204
198
195 235247
248 260
258268
261 274
269
275 331 373
5 1618
1925 34 51
32
29 76 94 119
129
133141154
138
135
128139
161
158 175182
180 206
202
197 214 229241
223 249
253
264 282
259 279
273 290 305
286 314324 338 364
367
9
8 103112
118 176 190
183 215
216226 243
250 262
263 278
287 300 319328 340 369
372
3 89 104 150 196 211 256
163 187 233 271
272285 327 366
351
4752 72
31 43
4448 60
64
61 70
68
63 83
81
82 91
92 111
90 117125
124131 152
149 165
167 184 217 232
213 244
236 270 283
288
295
297 312321
315 345
123456710 15
14
13 22
23
20 28
24
21
26 33
30 40
37
35
38
36
41
39 49
46
50
45 59
54
53
5562
57
58 67
66
65 74
71
69 79
78
75
73
77
8086
87
84
88 97
93
95101
98
99
100 110
106114
109
102 115
113
107
105
108 123
121
120130
127
122
116 132
126136145
143
140
134 148
142
147157
153
146
151
144 164
160
159
162
156
155166173
170
168 185
186
181
174
179
177
172
178 188199
192
191
193 207
200
201
194 210
208
203
205 219
220
212
209
204 227
224
221
225
228
222
218 242
240
238
237
231 246
239
234
230
235 251
245254
257
252
255 266
265
267
260
268 280
281
277
276 291
292
289
284 302
304
299
298
293
294 316
308
311
307
309
303
296 313
310
306 323
318
320
317
322
325332
330
329
337
333
326 346
339
334 344
348
341
342
347
343
335
336
331 356
352
349 361
358
354
350
355363
359370
362
357
360 375
371
368
365374
81117
912
16
18
19 27
25 34
32
29 42 51 76 85
8994103112
104 119
118 129
128137
133
138
135
141
139 154
150 161
158
163171
175 189
182
180
176 190
183
187 198
195
197
196 206
202 214223
215
216
211 229
226 247
241248
249
243 258
253
250 261
259
262
256 269
264
263274
275
273
271 282
278
272 290
286
279287
285 305
300 314324
319328
327 338
340 353364373
367
369
372
366
0
233 351
0
Figure 4.4: PCA Space - 2011 Autoland Figure 4.5: PCA Space - 2011 Autoland
Flights Comparison Flights Comparison (zoom)
1. The Outliers are immediately identied by any of the methods. these are case56 ,
case96 , case169 and case301 (Figures 4.2 and 4.4).
2. The remaining cases are clustered for lower variances (Figures 4.3 and 4.5). There
are a large number of cases close to the limits. This eect of analysing the data
through its principal directions enhances the visibility of cases that would not be
recognised if only the input space was being checked.
3. Concerning the SVM, the model used was trained with the same set that the
remaining methods are using for performing the comparisons. It provided the
conrmation of a large number of cases detected by the PCA and not detected by
the Input Space ±3σ . There are though a number of cases that are only detected
56
Table 4.1: 2011 Autoland Flights Detection Methods Comparison
yi = −1 Findings
Vectors Label (±3σ) Label (PCA ±6σ) Label (SVM)
√
case9 √
case11 √
case31 √
case43 √
case44 √ √
case47 √ √
case48 √ √ √
case52 √
case53 √ √ √
case56 √
case60 √
case61 √ √
case64 √
case67 √
case68 √ √
case70 √ √
case72 √
case74 √
case76 √
case78 √
case81 √
case82 √ √
case83 √ √
case91 √ √
case92 √ √ √
case96 √
case111 √
case112 √
case117 √
case118 √
case124 √
case125 √
case130 √
case131 √
case149 √
case152 √
case165 √
case168 √ √ √
case169 √
case199 √
case213 √
case217 √
case228 √
case244 √
case283 √ √
case288 √
case295 √
case297 √
case300 √ √ √
case301 √
case312 √
case318 √
case321 √
case326 √
case345 √
case352 √
case374
Total 5 37 34
by the SVM (Table 4.1). These cases have been identied as being tagged with
one of the original labels PPF or PI. Being discrete (binary) signals they are not
checked by the remaining methods. Only two cases are labelled just by SVM not
being in the situation above. These are case74 and case321 which correspond to
points very close to the decision limit.
57
4.3 Practical Application - Autoland Flights Year 2009
The current set to be submitted to the dierent methods is composed of 254 new cases
corresponding to Autoland ights detected by the AGS procedure for the year of 2009.
10
115
8
74
1000
Mean Distance
Mean Distance
6
4
500
23 240
2 67 148 198 211 216 229
8
2
70 97
2529 39 47 59 6672 85 94100 128
133
151 161
147154
158
160 178 206
201 231 241
254 34567 19 374146 55
3034
35 58 76 849091 110
111
109 117125
116 134 150
130 157
156 165
167 177182
189 203
213
209 222230237
214221 242
253 22 33 73 103
105
108 135 144
131 163 179 188 246
1216
11 20 8086
7579 8995101 121
122
124 137 146152
149
145 169176
166 194 204
200 225 238 249
1 913 1518 28
24 31 36 43
3238 525761
606468
62 81
8387 96
93
8892 98 106113
118 132 142
138 153 164
155162 171
173
170
175181
184
180 187 199
202 210
212 220228
219
218
215 235
232 248
244
1014 27
172126 45
44
42
40 48
50
4953 63
6569
71 77
7882 99 107
102 114120
119
129
126
123
127
140
139
143
141
136
172
168174 183
185
186
193
192
196
197
208
207 227234
233 247
243
239
236 245250
251
5154 104112 190195 205 217
191
224
223
56 226
159
2529 39 55 70 97
100 128 151 161 198
0
0
1215
13
14 19
16
1720
1822
21 26
24
2730
28 34
33
31
3237
35
3641
3843
40 46
47
45
44
42 4852
50
4953
515459
58
57
5660
61
62
6366
64
6568
69
7175
72
7376
7780
79
81
78 85
84
82
838690
91
89
87
88 94
95
92
9396
99103
101
98
102 110
109
108
105
106
107
104 116
113
117
111118
114
112 125
120
121
119 133
130
129
126
123
122
124
127 134
135
131
132138
137
140
139
136 146
147
145
143
144
142
141 152
149
153158
154
150
155160
157
156 165
167
163169
166
164
162 170
168 178
176
173
175
171177
172
174 182
179 189
188
184
181
180187
183
185
186 194
193
192
190
191 201
200
199
196
197
195 206
203
204
202
205211
213
209
210
208
207212219
215
218
214 225
220
222
221
217223 231
230
228
227
226
224 237
235
232
233 241
238 246
242
239
234
236243249
248
245
244250
251
247
Figure 4.6: Input Space - 2009 Autoland Figure 4.7: Input Space - 2009 Autoland
Flights Comparison Flights Comparison (zoom)
10
115
5000
74
4000
Mean Distance
Mean Distance
128
3000
151
25
94
241
2
4
29 145150 229
2000
2732 6066 7577 8893 105 117 140147 162 174 176 202 215 223 251
1000
4 13
364045495559 8490 99
7983
78 127 139
123 143149 197 221 237 242
22 43 51 62 64 97102 112 125132 155 168 195 203
208 217 244
254 46 56 63 104 170
172 191
190 200205 218 238
3538 207 247
243
245
6571
92
101107 141 226 236
239
253
159
252
0
0
123456781911
0
13
12
14 19
16
1518
172125
23
2024
22 29
26
2731
30
2833
34
32
3539
37
3641
40
3843
4247
45
44 50
48
49
465155
52
54
53 58
5761
60
59
56 6267
64
63
6568
66 75
73
70
6972
71 76
7781
80
79
78 85
82
8387
86
8488
89
91
9094
93
92 100
96
9598
99
97102108
103
104
101 113
110
109
106
105111
107114120
118
119
116
117
112 123128
129
125
124
121
122
127 134
131
130
126132138
135
136
133
137 145
140
143
139144
142
141 151
150
148
146
147 154
158
155161
156
153
157
152
149 166
167
163
165
160
162
164 173
169
168
170
171 179
178
175 184
181
176
177
174
172 182
183
180 189
188
187
185
186 194201
198
196
192
193199
195
190
191197 206
202
203209
204
205
200 210
213
211
208
207212218
214 229
225
220
216
219
215 223
222
221
217 231
232
228
226
224
227 234
235
233
230 240
241
237
238
236243
239 248
245
242
244250
249
246
251
247
Figure 4.8: PCA Space - 2009 Autoland Figure 4.9: PCA Space - 2009 Autoland
Flights Comparison Flights Comparison (zoom)
58
4.3.3 SVM Evaluation
The SVM evaluation produced a total of 26 cases identied with the label yi = −1.
These are presented in Table 4.2 in parallel with the reaming methods.
yi = −1 Findings
Vectors Label (±3σ) Label (PCA ±6σ) Label (SVM)
√
case2 √
case3 √
case7 √
case8 √
case11 √
case19 √
case20 √ √
case23 √
case25 √
case29 √
case30 √
case31 √
case41 √
case47 √
case55 √
case58 √
case62 √ √
case67 √ √ √
case74 √
case76 √ √
case94 √
case108 √
case113 √ √ √
case115 √
case125 √ √
case128 √
case145 √
case148 √
case150 √ √
case151 √
case154 √
case158 √ √ √
case159 √
case161 √
case165 √
case166 √
case179 √
case184 √
case189 √ √
case198 √
case201 √
case202 √
case206 √ √
case211 √
case212 √
case216 √
case221 √ √
case229 √
case231 √ √
case240 √
case241 √ √ √
case252 √ √ √
case253 √ √ √
case254
Total 6 43 26
59
1. The cases that are clearly identied as being out of boundaries are case74 , case115 ,
case159 , case252 , case253 and case254 .
2. In this set of testing data, again the eect of the analysis through the PCA basis
compared to the original dimensions, highlights a certain number of cases that
would be not identied if the rotation to the principal components of the set have
not been done (Figures 4.7 and 4.9).
3. The SVM classied as yi = −1 whenever there is a PPF or PI label from the
Standard Procedure, or the case is already identied by any of the other methods.
It shows again that it is a suitable method when discrete signals are present in
the testing data.
60
Chapter 5
In this work an Autoland Procedure was developed in the AGS Flight Data Analysis
Software. This procedure was based on extraction keypoints meeting certain condi-
tions during descend. The relevant parameters to be exported have been identied
and a vector of dimension 270 is created for each landing. In the same AGS procedure
the existing specications for Autoland analysis from the Aircraft Manufacturer and
former existing specications have been implemented. During the landing, the meet-
ing of certain conditions lead to a Poor Performance (PPF) or Poor Integrity (PI)
classication.
The AGS procedure produced a high dimensional matrix with outlier values present.
The problem to classify this information in an automatic way, extracting relevant
information and not loosing the legacy classications from previous methods was to be
tackled. As databases become larger it turns impracticable for a human operator to
process it and data-mining tools are each day more necessary.
On the search for possible solutions, three methods have been compared, testing
the data and tuning the methods as new results were found during the work. The
data was analysed on its original dimensions and the limits of ±3σ established to
determine the Abnormal cases. The Principal Components Analysis (PCA) performed
the analysis of the data on its principal directions, corresponding to its eigenvectors.
Here each new dimension is a linear combination of the original dimensions and are
ordered by decreasing order of variance. This new perspective of the data revealed
that the variance of the new components (dimensions) was higher than in the original
dimensions, which allowed to detect cases that became clear due to the fact that in
this new base, the correlation of the original components is part of the base. These
cases would be unnoticed if just the original base was taken into account.
In order to implement an automatic classication based on state-of-the-art method-
ologies, the Support Vector Machines (SVM) have been introduced. This is a Machine
Learning algorithm capable of classifying a large amount of data in a robust way. This
method relies on a Training Set, a set of data with known labels, in order to build a
model of the data. There is a complementarity between this method and the remaining
two, as the precision of the classication provided by the two previous methods is used
by the SVM to create the Training Set and produce a model of the data. The most
accurate the classication of the data is, the more reliable it is the model created. Once
the model is established it can be used indenitely with novelty data.
61
5.1 Conclusions on the obtained results
During this work a model of the data was created with a set of 354 Autoland ights,
and a total of 629 ights have been subsequently classied by the three methods under
study. Some conclusions on the methods used are provided next:
1. The Input Space ±3σ evaluation conducts to the detection of a very limited num-
ber of cases. For the 629 total cases, just 11 have been compliant with this criteria.
These cases proved though to be valid cases to detect corresponding to the Outliers
which are the cases where the most extreme values are present (unreal values).
They have been conrmed by the remaining methods.
2. The Primary Component Analysis enhances the correlation between the original
dimensions. Using a new base produced with a linear combination of the original
dimensions by decreasing order of variance, leads to a greater number of cases
detected. The detection is 80 cases out of 629 with the PCA.
3. The SVM was trained with the same vectors being used on the previous set as
the comparison set. This method proved to be the only one studied in this work,
capable of handling both analogue and binary data on the same set. Most of the
cases that are identied by this method correspond to the binary ags PPF or
PI activated. This makes also this method compliant with the legacy techniques
that are on the origin of this work. There was a total of 60 cases detected by this
method, out of 629, but not matching exactly the cases found by the PCA. It was
veried that the cases that PCA found and has not being followed by SVM are
cases close to the decision boundary of ±3σ . These cases may not be signicant,
but once they are identied by one of the methods it shall be labelled to be further
investigated.
4. It seems reasonable for the application under study, that the use of the complement
of the two methods PCA and SVM is the solution to have the nal labelling of
the cases. There is a complementarity of these methods that allow to detect
the maximum cases in a robust way. Making use of both methods there is a
total of 111 cases detected as Abnormal to be further investigated in a total of
629 Autolands. These are Outliers and minor drifts from the Normal average of
certain parameters. This automatic classication allowed that 518 cases have been
immediately recognized as Normal and no concern is given to these cases. The
remaining 111 are those that are labelled to be further investigated, but compared
with the original 629, this represent a workload drop of 82, 4% for the analyst.
62
1. Increase the number of features (dimensions) on each vector. The features
used for obtaining the model have been based on the from Chapter 2. The vec-
tor dimension being used, was high enough to draw conclusions on multivariate
techniques without loosing the capability for the individual evaluation of each di-
mension (Histogram analysis). Using the learnings from this project a new model
with a higher number of features can be produced in such a way that most of the
full 270 dimensions from the output le of the AGS procedure are monitored.
2. Improve the standard sets. The new cases, being classied as Normal, can be
added to the Standard database to robust the base set for the rst two methods.
Any classication can be used to improve the Training set for the SVM method,
in order to improve the quality of the model.
3. Create models for each runway that is CAT III certied. For this point
to be implemented, a substantial number of ights has to be available per runway.
A good candidate for this model would be Lisbon Airport (LPPT) RWY 21, from
which are the majority of the ights.
4. Create a model for the remaining les produced by the AGS procedure.
These are for Airbus A320 eet AU T O_AP P ROACH _320.CSV for the auto-
approaches according to the classication in Figure 2.7 and
REGU LARLAN D_320.CSV for all the manual landings operated, i.e., all the
remaining ights not classied either as Autoland or Autoapproach.
Despite this project evolved with LIBSVM for the Support Vector Machines imple-
mentation, there are other packages that should be taken into account and be part of
any future developments. An incomplete list of possible tools to use is provided next:
63
5.3 Final Considerations
In today's databases there is a large amount of data stored. This is the case of a typical
Flight Data Analysis system used in Airline's Flight Safety Departments to keep track
of any deviations from standard operation. These databases increase their size as the
processing of the data from the airline's operation occurs.
The traditional approach to the ight data analysis is by creating pre-determined
conditions, that once attained produce one Event, i.e., the report that was found one
condition beyond some established limit. This can be an over-speed, a hard-landing or
an unstabilized approach.
Having such an amount of data to analyse in a Flight Data Analysis system, data-
mining tools, such as those explored in this work, are extremely valuable to support
the analyst to draw conclusions from a large and high-dimensional set of data. The
Principal Component Analysis and the Support Vector Machines (SVM) proved to be
two robust methods for pattern recognition, and being more used everyday in dierent
areas of knowledge. These techniques have been explored in this project to evaluate
possible Abnormal cases among the set of data from the ights that was generated.
This thesis is a rst approach in TAP Portugal for using the capabilities of PCA
and SVM to the analysis of ights. It also opens the opportunity for future researches
and applications.
64
Appendix A
65
Table A.2 continued from previous page
Calculation Control Parameter(s) Procedure description
Max Brake Pressure BRAKE_Pn Appendix B.8
Time between TD and APx Disengage T, AP_EGDn Appendix B.8.1
Time between TD and AUTOLAND_ON T, AUTOLAND_ON Appendix B.8.1
Time TR Actuated T, REV_DEPLOYEDn Appendix B.8.1
Reverse Type N1n,T,REV_DEPLOYEDn Appendix B.8.2
Glide Slope Track FMA_GS_TRK, APMOD14 Appendix B.7
Landing Distance T, GS Appendix B.6
67
68
Appendix B
Additional Procedures
In this Appendix several procedures are presented, with results and/or information
that are necessary at some step during this project. The information in the following
sections supports the main procedure from this project presented in Apeendix C.
−→ −−→ −−→
GS = T AS + W S (B.1)
In this way, we can measure the Tail Wind or Head Wind, depending on the sign of
the WIND_COMP.
69
Figure B.1: Velocity Triangle
Equations B.1 to B.5 are valid if the aircraft is airborne. As soon as it touches the
ground it is mechanically connected to the ground and the drift angle is zero (δ = 0).
In this case Equation B.2 is simplied to:
W IN D_COM P = GS − T AS (B.6)
///////////////////////////////////////////////////////////////////////////
// WIND COMPONENT AND CROSS WIND
// =============================
// If aircraft on ground
//the TAS value only appears when GS>60knot
IF ((LDG_ON_1=1 OR LDG_ON_2=1 OR LDGN_ON=1) AND TAS # 0) THEN
70
WIN_COMP=GSC-TAS
WIN_CROSS=0
ENDIF
////////////////////////////////////////////////////////////////////////////////////////
The most direct parameters to determine the Touch-Down point are the Landing
Gear Squat-Switch signals, which are named for the A320 eet as LDG_ON_1 and
LDG_ON_2 for the Left and Right sides of the MLG (Table A.3). There is a delay
on the indication from these signals which makes this indication less accurate as others
that will be described in this section. For some applications, where the level of accuracy
is not necessary, this indications can be used.
In the case where a more precise denition of Touch-Down is necessary, such as the
landings for this report, these signals does not provide the precision needed. There are
several reasons why this happens. First, the sampling rate of those parameters is 1 Hz,
which limits to one second the granularity of the determination of the Touch-Down
point. Second the squat switch is pressed after the shock absorber from the landing
gear travels some distance when it is compressed during landing. At the time the
switch is activated, the Touch-Down point already occurred some time before. At last
it is not unusual that the Right LG and the Left LG from the MLG do not happen at
the same time, which introduces another factor for inaccuracy of this signal.
71
Another way to determine the Touch-Down, is from the normal and longitudinal
accelerations. These are presented in Figs B.2 and B.3. Studies in [37] and [38] indicate
that the point where the MLG touches the ground is when the normal acceleration
starts to raize for the st time (Fig B.2) or when the longitudinal acceleration suers
a peak down, more precisely in the beginning of this peak (Fig B.3). In the gures
it is represented the local maximum and minimum respectively. Both correspond to
reaction forces from the ground contact, which is the base of the assumption that the
Touch-Down has occurred before these local extremes.
1.3
VRTG [G]
1.25
1.2
1.15
1.1
1.05
0.95
0.9
0.8
0 20 40 60 80 100 120 140 160
0.15
0.05
−0.05
−0.1
−0.15
First Min Peak LONG
−0.2
0 20 40 60 80 100 120 140 160
Despite the Touch-Down can be visualised in the graphics of the accelerations, the
use of these parameters for its determination present some problems that make it
dicult to determine the Touch-Down out of the accelerations. The landing can be
soft, and the rst peak of normal acceleration can be easily confused with the peaks
that necessarily will appear after after the Touch-Down. Also, as stated above, this
peak corresponds to the ground reaction force, which occurs after the Touch-Down
itself. This makes dicult to use normal acceleration to determine Touch-Down for all
the ights processed in an automatic way.
To overcome the normal acceleration limitations, in [37], this quantity was derivated
creating a new parameter named Jerk . As discussed before, the dierentiation induces
72
noise, and a ltering algorithm has to be applied to reduce this behaviour (see section
B.3.1). Despite this is a valid method, it introduces additional calculations without the
warranty that it will become immune to noise. This is why it will not be considered in
this work for Touch-Down determination.
The longitudinal acceleration will be considered in the nal algorithm, as the nega-
tive peak is normally present even in soft landings. Nevertheless, it is not a standalone
process for determination.
In Figures B.4 and B.5 other possibilities of determining Touch-Down are explored.
The characteristic of the Radio Altimeters also used in Section B.6, state that Radio
Altitude is measuring the height between the ground and the MLG point closest to the
ground. From the calibration process during the installation of the Radio Altimeters
(RAs) and system conception, this is true provided θ < 6◦ which is normally the case.
This means that in the exact moment where the Aircraft (A/C) touches the ground,
there will be a Radio Altitude (RA) equal to zero (see Fig B.27).
The other parameters in Figs B.4 and B.5 are the squat switches and the speed
of the wheels from the MLG. From these images some interesting conclusions can be
taken:
• All the wheels from the MLG start rotating at the same time for this landing,
and this matches the time when the Left Hand (LH) and Right Hand (RH) squat
switches from the MLG are active.
• From Fig B.5 there are 6 samples separating the last RA that passes through zero
and the activation of the squat switches. This data is from the interpolated 8 Hz
le dened in Table ??, which means that there are 6/8seconds separating the
two measurements in the gure.
• Once the activation of the squat switch (Landing Gear ON) is recorded at 1 Hz,
the activation in the gures just state that at it assumes the value one, the MLG
touched the ground during the last second. From the previous item the Radio
Altitudes crossing are within the second delay from the squat switches.
• RA is a measurement from an electronic emission and reception. There are no
mechanical parts involved to produce any delay. Both Radio Altitude parameters
are sampled at 4 Hz, which multiplies by 4, the possibility of nding Touch-Down
within 1 second of sampling from MLG squat switches.
A nal comparison between the RA determination for Touch-Down, and the points
that could be found with VRTG, LONG and MLG Squat Switch are presented in Fig-
ures B.6, B.7 and B.8. The origin of these gures is the Touch-Down point determined
by the RA and each x division corresponds to 1/8 sec.
The MLG actuation occurs 7 samples after the RA (Fig B.8) and from Figures B.6
and B.7 the Touch-Down is recognised 3 samples after the RA, i.e., when the transition
starts. There is no way to know which is the real Touch-Down point, but the maximum
error that can appear is 3/8 second.
Considering the points above, the algorithm for Touch-Down determination will use
primarily the radio altitude zero crossing, which is the most acurate result studied,
and as backup both the longitudinal acceleration and squat switch activation in case
73
60
RALT1 [ft]
50 RALT2 [ft]
Wheel Speed (MLG)
Left and Right LG (MLG)
40
30
20
10
−10
0 5 10 15 20 25 30 35
25
20 RALT1 [ft]
RALT2 [ft]
Wheel Speed (MLG)
15 Left and Right LG (MLG)
10
−5
10 12 14 16 18 20 22 24 26
1.25
1.2
1.15
1.1
1.05
0.9
0.85
1 2 3 4 5 6 7 8 9 10 11
there is a failure on the radio altitude parameters. The code for this procedure in
AGS is listed on section B.2.1. This code produces a logic one value for the variable
TOUCH_DOWNC (which stands for Touch Down Corrected), whenever the Touch-
Down is determined. Any of the 3 methods can be used to this calculation. The
code also provides which was the method used for Touch-Down determination, by the
means of a ag identifying each condition (parameters FLAG_RA, FLAG_LG and
74
0.04
Longitudinal Acceleration [G]
0.03
0.02
0.01
−0.01
−0.02
−0.03
−0.04
−0.05
1 2 3 4 5 6 7 8 9 10 11
1
Right and Left LG (MLG)
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
1 2 3 4 5 6 7 8 9 10 11
FLAG_LAC).
///////////////////////////////////////////////////////////////////////////
// Procedure P606AP - Touch-Down Determination
// Created : Jun 2011
// Rate: 8 Hz
// Database: 10344
//
// Modification History:
// Modif Date By Details
// 16 16 JUN 11 HM Procedure created from P8352MP
// 17 20 Jun 11 HM RALT = 1st criteria / LND Gear and Long Accel
// backup criterias. Flags to determine which
// criteria was used for Touch-Down determination
75
IF ((LDG_ON_1(-0.125)=0 AND LDG_ON_1=1) OR (LDG_ON_2(-0.125)=0 AND LDG_ON_2=1) ) THEN
STEPTD=1
FLAG_LG =1
ELSE // Longitudinal Acceleration Criteria
IF ((LONG(-0.125)>=0 AND LONG<0)) THEN
STEPTD=1
FLAG_LAC =1
ELSE
STEPTD=0
ENDIF
ENDIF
ENDIF
ENDIF
ENDIF
////////////////////////////////////////////////////////////////////////////////////////
3500
2000
1500
1000
500
0
Touch Down
−500
0 500 1000 1500
76
The vertical velocity, is a parameter that is available in the cockpit as a primary
information for the ight. It is present on Airbus eets on a display known as PFD
which is available both for Capt and FO (Fig B.10).
200
−200
−400
−600
−800
−1000
−1400
0 500 1000 1500
Figure B.11: Baro-Inertial Vertical Velocity for a Landing in LPPT (RWY 03)
77
Touch-Down, which makes baro-inertial vertical velocity a questionable measurement
to determine at this point. These variations induced by the ground eect can be seen
in more detail in Fig B.12, displaying IVVR for a barometric altitude below 300ft.
−100
IVVR [ft/min]
−200
−300
−400
−500
−600
−700
Touch Down
−800
0 20 40 60 80 100 120
150
RALT_RATE (DMU)[ft/sec]
100
50
−50
−100
Touch Down
−150
0 500 1000 1500
The other option to get the sink rate value at Touch-Down, is through the Radio
Altitude Rate. As mentioned before the landing under consideration has this parameter
recorded, which is plotted in Fig B.13. This is an internal computation from the DMU
or acFDIMU (see Section 1.2 for system description).
Whenever a parameter such as Radio Altitude (or any other) is dierentiated, there
is a signicant amount of noise induced by the algorithm applied. As it is documented
in [37] and [38], after the dierentiation a smoothing lter has to be applied in order
to reduce the noise previously introduced.
Airbus Algorithm is documented in Aircraft Maintenance Manual (AMM) [39]. It
is based on a Kalman Filter technique and the output is plotted in Fig B.13. Some
comments on this graphic are the following:
• Radio Altitude Rate does not provide correct values of Vertical Speed when there
are ground irregularities below the ight path. Those are interpreted as climbs or
descends inappropriately
78
• From the previous item, just for at terrain the value of RALT Rate can be
interpreted as Sink Speed. In the ight under study in this section this happens
below a standard altitude of 300ft approximately, and the nal part of the graphic
can be used for the purpose of determining sink rate.
• Despite there is an amount of ltering, the variations on this parameter are sig-
nicant. Some comparisons with other ltering techniques will be studied.
To gure out the best way to get the sink rate, other parameters have been calcu-
lated:
dRALT D1(t)
RALT RC1 = (B.7)
dt
The value determined with B.7, is the direct dierentiation of Radio Altitude. As
referred before, any method used in this calculation induces noise that has to be cleaned
out. For this purpose there is a function in AGSTM , named F_NOISE() that will be
applied to parameter RALTRC1, resulting RALTRC2. A second method of noise
reduction will be implemented, known as Gaussian Smoothing. Studies with the use
of this method can be found in [40], and a brief explanation is in B.3.1.
According to AGSTM documentation, the function F_NOISE() is dened as in Equa-
tion B.8, where N RV is the input parameter and F O is the output, corresponding to
the smoothed parameter.
1
F O = OF V + · (N RV − OF V ) (B.8)
3
Where:
• FO → Filtered Output
• OFV → Old Filtered Value
• NRV → New Raw Value
Both methods are applied to RALTRC1 parameter resulting in the signals presented
in Figs B.14 and B.15.
At this stage, there are three methods for ltering that can be selected. These are:
A comparison between these methods are presented in Fig B.16 for the whole landing
under study, and Fig B.17 for an altitude standard lower than 200 ft (ALT_STD <
79
300
RALTRC1 [ft/s] (AGS)
RALTRC2 [ft/s] (AGS)
200
100
−100
−200
−300
Touch Down
−400
0 500 1000 1500
300
RALTRC1 (AGS) [ft/s]
GAUSS(RALTRC1) [ft/s]
200
100
−100
−200
−300
Touch Down
−400
0 500 1000 1500
100
RALTR [ft/s] (DMU)
−100
0 200 400 600 800 1000 1200
100
RALTRC2 [ft/s]
−100
0 200 400 600 800 1000 1200
100
GAUSS(RALTRC1) [ft/s]
−100
0 200 400 600 800 1000 1200
200ft). These gures help with the conclusion for determining the best method for sink
rate determination.
As a rst conclusion, the RALTR (DMU) despite being calculated on the aircraft
has some inconvenient to be considered. It keeps a large amount of high frequencies
80
2
RALTRC2 [ft/s]
GAUSS(RALTRC1) [ft/s]
0
RALTR [ft/s] (DMU)
−2
−4
−6
−8
−10
−12
−16
0 10 20 30 40 50 60
variations and the it is recorded at a 1 Hz sampling rate compared to the other two
determinations that are based in Radio altitude sampled at 4 Hz. From this last
limitation, one can see that the value from RALTR (DMU) is zero, when Touch-Down
is determined, i.e., the other calculations have 4 values calculated within each one
available from the DMU.
The gures B.16 and B.17 also show that RALTRC2 based on F_NOISE() function
from AGS applied to the Radio Altitude dierential, has a similar smoothing capability
than the previous method. It can though be used when Touch-Down is determined,
despite the variations of RALTRC2 are high in the neighbourhood of this point.
The best method to be considered is the Gaussian Smoothing which presents a
soft line, going along with the trend of the other two methods and in the vicinity of
Touch-Down there is not a great amount of signal variation.
Z
ξ(t0 ) = x(t) · g(t − t0 )dt (B.9)
This Equation is used for continuous parameters. For sampled parameters (discrete)
the following equation applies (Eq B.10) in a vicinity ∆T of each point t0 calculated.
X
ξ(t0 ) = x(t) · g(t − t0 ) (B.10)
The function g(t − t0 ) is the weight assigned to each new calculation. This function
is chosen in a way that the closest t is to t0 , the highest is the weight calculated. The
81
Gaussian bell shaped curve is the suited function for g(t − t0 ), as after deciding which
is the σ to be used, the values decrease weight as the distance to the current sample
increases.
In this way, g(t − t0 ) is dened as:
(t−t0 )2
g(t − t0 ) = N · e− 2.σ 2 (B.11)
g(t − t0 ) shall be normalised, i.e., the value of N ) in Equation B.11 will be the one
that makes the sum on all calculated points equal to one.
The standard deviation, σ , corresponds to the lter size. The largest this value, the
more points contribute to the calculation and the lter will be less sensitive to signal
uctuations. This behaviour is well exemplied in gures B.18,B.19 and B.20. The
peak value has inuence on the ltered output for σ = 10, due to the fact that the
bell curve is more open for this value and the peak value is considered on more distant
points of the calculation. Any choice which is made for σ , performs the weighting of
the function with past and future values. This is a Non-Causal Filter.
A compromise shall be established for deciding which σ to use for calculations on
this section. Both values of σ = 2 and σ = 4 provide good results for the trend line.
As an option, it is used σ = 4 for the calculations in this section, due to the results
provided with this value and presented in this section.
Another compromise is obtained from testing the results which are presented in the
graphics of this section. the vicinity of the convolution presented in Equation B.10, has
to be assigned. The rst code produced in Matlab included all the samples presented
in a vector. This is a large amount of calculations, and despite it produced results, it
took a long processing time to nish. From the nature of the bell curve itself, we know
that the side values from both sides of the curve will contribute with a negligible value
to the convolution of Eq B.10.
Testing the values the conclusion was that 10 samples each side could be used with
σ = 4, providing similar results as the total amount of samples. For this reason a total
of 20 samples (±10) is being used in code presented in B.3.2.
82
Figure B.19: Gaussian Smooth - Example with σ = 4
Considering the ight under study in this section, the comparison of the ltered
parameters with the original diferential of radio altitude without any ltering (RAL-
TRC1) can be seen in Figs B.21 and B.22. These gures indicate that the Gaussian
smoothing algorithm is the one best suited for the purpose of removing the noise, which
is presented as high frequencies variations in the gures.
20
10
−10
−20
−30
−40
−50
RALTRC1 [ft/s]
−60 GAUSS(RALTRC1) [ft/s]
Touch Down
RALTRC2 [ft/s]
−70
0 20 40 60 80 100 120
83
5
RALTRC1 [ft/s]
GAUSS(RALTRC1) [ft/s]
RALTRC2 [ft/s]
0
−5
−10
−15
Touch Down
−20
0 10 20 30 40 50 60
%
% Gaussian Smoothing of RALTRC1 = dRALTD1/dt
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
N = size(raltrc1);
for i=1:N(1),
t(i) = i;
end;
sigma = 4;
for j= 11:(N(1)-10),
t0=t(j);
soma(j)=0;
sumgauss=0;
k = j -10;
for i= 1:20,
sumgauss = sumgauss + exp((-1)*((t0-t(k))^2)/(2*sigma^2));
k = k + 1;
end;
%
k = j -10;
for i= 1:20,
weight = exp((-1)*((t0-t(k))^2)/(2*sigma^2)) / sumgauss;
soma(j)=soma(j)+raltrc1(k)*weight;
k = k + 1;
end;
end;
% First 10 values of the vector are not filtered -> replace the zeros with original values
%
soma(1:10) = raltrc1(1:10);
%
% End of Gaussian Smoothing
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
84
B.4 Flaps/Slats Conguration
Flaps and Slats are the mechanisms installed on aircraft to produce lift augmentation.
These are surfaces installed on the trailing and leading edges of the wing respectively.
Whenever there is one actuation of the Flaps/Slats Lever (Fig. B.23) some amount of
surfaces extension is produced. Each detent position leads to the angles described in
Table B.1.
There is a speed protection associated with the position #1 of the lever position,
which allows Flaps to extend to 10 ◦ when an Indicated Airspeed is below 210 Knots
in order to protect the surface from structural damage. If the speed drops below 100
Knots teh aps extend automatically to 10 ◦ , i.e., position 1+F, in order that lift can
be guaranteed despite the low speed of the aircraft.
85
pressing an instinctive disconnect button (red button in Fig. B.24). This signal is
recorded as ATHID (Table A.4).
There is a correspondence between the Thrust Lever angle measured and its detent
position. This is important to determine, as in most of the situations the detent position
of the throttle is more relevant than the angle itself. This deciding is described in Fig
B.25 and Table B.2.
Table B.2: Angle to Position Conversion
86
B.6 Threshold Crossing
The method used to determine the Threshold crossing is described in [37] (pag. 13),
and is valid whenever the approach runway is equipped with an ILS, from where a sta-
ble glide slope signal is measured. The geometry represented in Fig B.26, DT H is the
distance on the runway between the glide-slope antenna and the point corresponding
to the Threshold of 50ft, i.e., DT H = 50/ tan(3◦ ) = 954f t.
The calculations in this section assume that Radio Altitude is measuring the height
between the ground and the MLG point closest to the ground. This is true provided
θ < 6◦ due to the calibration process of the Radio Altimeter during its installation.
Also, the reception point of the Glide-Slope signal is located inside the radome of the
aircraft where the antenna is installed. Having Fig B.27 based on a CAD drawings for
each Airbus eet, LGS and VGS can be determined. The values are presented on Table
B.3.
From Fig B.27 and Table B.3, we have the relations:
87
Table B.3: Aircraft Dimensions
A319 A320 A321 A330 A340
[ft] [m] [ft] [m] [ft] [m] [ft] [m] [ft] [m]
49.71 15.15 54.95 16.75 68.90 21 90.88 27.70 102.17 31.14
9.94 3.03 9.77 2.98 9.84 3.00 17.22 5.25 17.32 5.28
LGS
VGS
Radio Altitude
Height of Glide Slope Antenna
hr
Speed Vector
∆GS
Threshold
v
The geometries presented allow to determine when the aircraft passed over the
threshold point provided there is a valid ILS signal. Equations B.13 and B.14 perform
this calculation assuming that there is a 3◦ slope from the ILS. This is the most common
situation.
50
DT H = = 954f t (B.13)
tan(3◦ )
hGS
DGS = (B.14)
tan(∆GS + 3◦ )
All the quantities in this section are calculated in AGSTM for the development of this
work.
///////////////////////////////////////////////////////////////////////////
// Procedure P631AP - Threshold Crossing Computation
// Created : Sept 2010
// Rate: 8 Hz
// Database: 10344
88
//
// Modification History:
// Modif Date By Details
// 29 29 Jun 2011 HM Threshold Crossing Code for A320fam fleet
//
//
//
// Determinação do Treshold
ENDIF
// Treshold crossing
//
IF (DGS(-0.125)>=DTH AND DGS<DTH AND TRESH=0) THEN
TRESH = 1
ENDIF
//////////////////////////////////////////////////////////////////////////////////////
89
Once both LOC and G/S are captured, the autopilot enters another mode called
TRACK, and there is on the FMA the presentation of both LOC TRK and GS TRK.
When an altitude of ≈ 400ft AGL is attained, LOC TRK and GS TRK merge with
each other, and a new mode named LAND TRK appears on the FMA. This logic can
be derived both from the avionic unit that displays this information on the cockpit
displays, the DMC, or from the source of this data, the FGC (Part of the FMGC on
A320).
• FMA_LG_TRK
90
performed are presented.
∆ T Description
∆T_LT Time between Landing Track and Touch-Down
∆T_100 Time between RALT = 100ft and Touch-Down
∆T_50 Time between RALT = 50ft and Touch-Down
∆T_30 Time between RALT = 30ft and Touch-Down
∆T_FL Time between Flare Mode and Touch-Down
∆T_AL Time between Allign Mode and Touch-Down
∆T_TH Time between Treshold Cross and Touch-Down
∆T_LG Time between Main Landing Gear and Nose LG Ground Touch
• IDLE - If during Reverse the maximum power applied to the Engines is below
50% of N1
• FULL - If during Reverse the maximum power applied to the Engines is above
50% of N1 for at least 5 seconds
• NO - No Reverse applied
As presented in B.5, the transit between Idle and Full Reverse throttle positions
makes the engine power increase. In the Reverse situation a backward movement of
the throttle produces the increase of the power.
Other than this determination, there is counter (THR_COUNT), that determines
the number of seconds that reverse is applied.
///////////////////////////////////////////////////////////////////////////
// Reverse Type and duration determination
//
IF (FM_FWC = 8 OR FM_FWC = 9) THEN
IF ((REV_DEPLOYED1 = 1) AND (REV_DEPLOYED2 = 1)) THEN
MAX(N11C, N11_MAX)
MAX(N12C, N12_MAX)
IF ((N11_MAX < 50) AND (N12_MAX < 50)) THEN
91
TR = 'IDLE'
TRN = 1
THRUST_REV = 10
ENDIF
IF ((N11_MAX >= 50) AND (N12_MAX >= 50)) DURING 5 THEN
TR = 'FULL'
TRN = 2
THRUST_REV = 20
ELSE
TR = 'IDLE'
TRN = 1
THRUST_REV = 10
ENDIF
THR_COUNT = THR_COUNT + 1
ENDIF
ENDIF
IF (THR_COUNT = 0) THEN
TR = 'NO'
TRN = 0
ENDIF
//////////////////////////////////////////////////////////////////////////////////////
92
Appendix C
93
Appendix D
94
Appendix E
According to LIBSVM authors [36], the advised proper use of this tool is summarized
on the following items:
95
• Convert the Data to LIBSVM format
• Scaling the Data
• Use The RBF Kernel Function
• Cross Validation to determine the best C and γ
• Training to create the model
• Test the model to determine new predictions
These items are taken into consideration for the model construction and data testing
on the following sections.
In the case under study, the input space X ∈ Rm is a space with dimension m = 42.
An extra information will be added corresponding to the label Y ≡ {±1}, which will
assign a classication to each vector. This step will require that a criteria is established
and this is of major importance, as future results from the SVM implementation derive
directly from the reliability of this labelling.
The rst problem to create a model with LIBSVM that has to be addressed, is the
matching between the output le from the AGS procedure (Appendix C) and the input
to LIBSVM. In order to solve it, a sequence of data manipulation programs have been
implemented. This process is described in Figure E.2.
In the case of Airbus A320 eet, the AGS procedure creates the CSV le named AU-
TOLAND_320RAW.CSV which is converted to AUTOLAND_320.CSV if a double-
comma is found, meaning that there was some data that was not exported during the
analysis process. This can happen if the landing runway is not found in the runway
or there are any errors during this process. The le at this stage corresponds to the
complete le with 270 features described in Section 2.3.4. Then it is converted to the
42 feature autoland2svm.csv using the program auto2libsvm.c (Section ??).
The format conversion from CSV to LIBSVM is performed with the convert.c pro-
gram (Section E.1), resulting the le autoland2svm. This le is ready to be used by
96
LIBSVM, but there is still the optional step during the model creation to split this
le in two for the matter of checking the accuracy of the model. Provided the labels
from le autoland2svm are known, part of this le creates the le autoland2svm.tr for
training the model, and the remaining are the contents of le autoland2svm.te used for
testing the model. This method will be used in Section E.4.
97
Table E.1: Criteria to determine the SVM Training File
The second option tested (Criteria #2), was performed keeping the ±2σ but just to
Flare, Touch-Down keypoints and the Ground calculations. There is one improvement
on the number of cases with the Abnormal classication still in greater number than the
Normal which turns out to be one handicap, but the major drawback of this approach
is that it has the same problem as using the Poor Performance criteria, which is relying
just on keypoints close to the ground.
The solution from Criteria #3, seemed to be the most appropriate, as it produces
the analysis on all the 42 features being the values that are out of ±3σ from the average
are considered as Abnormal. The le produced with this criteria will be used in the
remaining sections of this chapter.
#_Correctly _P redicted_Data
Accuracy = · 100% (E.1)
#_T otal_T esting _Data
In this way a training le with 320 vectors and a testing le with the remaining
34 vectors, named respectively autoland2svm_3sigma.tr and autoland2svm_3sigma.te
will be created from the original le.
98
The rst command scales all the features into a ±1 range, stores this range to be
used in the second command and saves the result into autoland2svm_3sigma.tr.scale
to create the training scaled le. The second command reads the range and creates the
scaled testing le.
99
Being a trial-and-error method, it is convenient that a large amount of trials are
produced at this stage, to follow the guidelines of LIBSVM's auhors [36], so that a
reliable model is found to be used with novelty data.
The tests performed to improve CV-Accuracy are summarized in Table E.2. Test
#1 corresponds to the default values of grid.py which are the starting point to the
successive improvements. Despite other combinations have been tried, none revealed
better results than Test #9 here presented.
Table E.2: Summary of Tests Performed to improve CV-Accuracy
During the running of each test, a contour plot representing the lines of equal CV-
Accuracy is created. In Figures E.4 and E.5 are presented the contour plots for Test
#1 and #9 respectively.
The accuracy determined provides the reliability of the model. In this case, it is
expected that the degree of accuracy in the determination of labels, is not lower than
the CV-Accuracy from Table E.2 , for each pair (C, γ) determined.
100
Figure E.5: Contour Plot - Best (C, γ) determined
svm_type c_svc
kernel_type rbf
gamma 0.594604
nr_class 2
total_sv 202
rho 0.573805
label 1 -1
nr_sv 118 84
SV
0.3100271947396164 1:0.0921502 2:-0.712482 3:0.319435 4:0.112164 5:0.548387 ...(till feature 42:)
0.9949895865478108 1:-0.273038 2:-0.630435 3:0.39174 4:-0.109005 5:0.419355 6:-0.386364 ....
(....)
-1.277788247286919 1:-3.33067e-016 2:-0.520337 3:0.278269 4:-0.222749 5:0.548387 7:0.232323 ....
101
-0.7381174596955521 1:0.180887 2:-0.575035 3:0.267977 4:0.00157978 5:0.419355 6:-0.386364 .....
E.4.4 Prediction
The model le determined in the previous step can be now tested with the 34 vectors
that have been separated from the initial training set of 354. The command line is:
This script acts together with grid.py, and the executables of svm-scale, svm.train
and svm-predict from LIBSVM . The grid search was changed according to the input
values from Table E.2 to produce a ner scan than the default values.
102
Bibliography
[1] European Aviation Safety Agency, Ocial Journal of European Union. EASA, 29 Sep 2008.
[2] The Evolution of Flight Data Analysis, Neil A.H. Campbell, http://ebookbrowse.com/the-evolution-of-
ight-data-analysis-neil-campbell-pdf-d45049768.
[3] BEA - Bureau d'Enquetes et d'Analyses, Flight Data Recorder Read-Out Technical and Regulatory
Aspects. BEA, Paris, (www.bea.fr) May 2005.
[4] SFIM / SAGEM - Safran Group, Flight and Crash Recording Systems for Airliners. SFIM, Paris,
(www.sagem-ds.com) Doc No CDC/SFIM 70.102
[5] Capt. Carlos Nunes, Flight Data Monitoring, TAP Air Portugal - 39 Years Experience. The First European
Flight Data Conference Lisbon, Portugal, SEP 2001.
[6] EUROCAE, ED112 - Minimum Operational Performance Specication for Crash Protected Airborne
Recorder System. Paris, France, MAR 2003.
[7] ARINC, ARINC SPECIFICATION 429P2-16. MARK 33 DIGITAL INFORMATION TRANSFER SYS-
TEM (DITS), AERONAUTICAL RADIO, INC. 2551 RIVA ROAD ANNAPOLIS, MARYLAND, 21401-
7465 USA December 17, 2004.
[8] ARINC, ARINC CHARACTERISTIC 717-15. FLIGHT DATA ACQUISITION AND RECORDING SYS-
TEM, AERONAUTICAL RADIO, INC. 2551 RIVA ROAD ANNAPOLIS, MARYLAND, 21401-7465 USA
June 06, 2011.
[9] Helder Mendes, Wireless Flight Data Transmission. TAP Portugal, Flight Safety Bulletim (No 77) Lisbon,
Portugal, 2010.
[10] SAGEM -Safran Group, Analysis Ground Station V12 Users Manual. Paris, France, 2008.
[11] Peter Waller,Collin J. Tate, L. Snellgrove, N. Gould Tristar Autoight Analysis. British Airways, London,
UK, 1981.
[12] Flight Data Co, GRAF Autoight Analysis Specication - A320. Flight Data Company, London, UK,
1994.
[13] Flight Data Co, GRAF Autoight Analysis Specication - A340. Flight Data Company, London, UK,
1996.
[14] Flight Data Co, GRAF 5 Autoight Analysis Program. Flight Data Company, London, UK, 1989.
[15] Santanu Das, Bryan L. Mathews, Ashok N. Srivastava, Multiple Kernel Learning for Heterogeneous
Anomaly Detection: Algorithm and Aviation Safety Case Study. NASA Ames Research Center, Moett
Field, CA, 2010.
[16] Edward Smart, Honghai Lu, Chis Jesse and David Brown, Qualitative classication of descent phases in
commercial ight data. Int. J. Computational Intelligence Studies, Vol I, No 1, 2009.
[17] Edward Smart, Detecting Abnormalities in Aircraft Flight Data and Ranking their Impact on the Flight
. Institute of Industrial Research, University of Portsmounth, 20th April 2011.
[18] Paul Hayton, Bernhard Schölkopf, Lionel Tarassenko, Paul Anuzis, Support Vector Novelty Detection
Applied to Jet Engine. 14th European Aviation Safety Seminar, Budapest Hungary, March 11-13, 2002.
103
[19] Paul Hayton, Simukai Utete, Dennis King, Paul Anuzis, Lionel Tarassenko, Static and dynamic novelty
detection methods for jet engine health monitoring. Philisophical Transactions of the Royal Society, 12
December 2006.
[20] Airbus Industries, Getting to Grips with CAT II/ CAT III Operations. Issue 3 Toulousse/Blagnac, France,
2001.
[21] Herbert Taub, Digital Circuits and Microprocessors. McGraw-Hill, 1985.
[22] Airbus Industries, Flight Crew Operations Manual Airbus A320. Toulousse/Blagnac, France, 2011.
[23] Martin Bilodeau, David Brenner, Theory of Multivariate Statistics. Springer Texts in Statistics , Springer-
Verlag 1999.
[24] Wolfgang Härdle, Léopold Simar, Applied Multivariate Statistical Analysis. Second Edition, Springer,
2007.
[25] Brian Everitt, Torsten Hothorn, An Introduction to Applied Multivariate Analysis with R. Springer
Science-Business Media, 2011.
[26] K.R. Gabriel, The biplot graphic display of matrices with applications to principal component analysis.
Biometriks, 1971.
[27] Michael Greenacre, Biplots in Practice. (www.fbbva.es), Fundación BBVA, 2010.
[28] Suratna Budalakoti, Ashok N. Srivastava, Matthew E. Otey Anomaly Detection and Diagnosis Algorithms
for Discrete Symbol Sequences with Applications to Airline Safety. IEEE Transactions on Systems Man
and Cybernetics, August 28, 2008.
[29] Colin Campbell and Yiming Ying, Learning With Support Vector Machines. Morgan & Claypool Pub-
lishers, 2011.
[30] Brian C. Lovell and Christian J. Walder, Support Vector Machines for Business Applications. The Uni-
versity of Queensland and Max Plank Institute, Tubingen .
[31] Ana Carolina Lorena e André C.P.L. de Carvalho, Uma Introdução às Support Vector Machines. USP -
São Carlos, Brasil, RITA, Volume XIV, Número 2, 2007.
[32] Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a library for support vector machines. De-
partment of Computer Science, National Taiwan University, Taipei 106, Taiwan. Available at
http://www.csie.ntu.edu.tw/ cjlin/libsvm. May 20, 2011.
[33] Andrew Ng, CS229 Machine Learning. http://cs229.stanford.edu, Stanford University, USA, Autumn
2011.
[34] Francisco Fabian de Macedo Almeida, Support Vector Machine. Universidade Federal de Campina
Grande, Brasil, Junho, 2007.
[35] Fundação Calouste Gulbenkian, Optimização Linear e Não Linear. L. Valadares Tavares, F. Nunes Cor-
reia, Lisboa, Portugal, Julho, 1986.
[36] Chih-Wei Hsu, Chih-Chung Chang and Chih-Jen Lin, A Practical Guide to Support Vector Classication.
Department of Computer Science, National Taiwan University, Taipei 106, Taiwan. Available at
http://www.csie.ntu.edu.tw/ cjlin/libsvm. April 15, 2010.
[37] DOT/FAA/AR-07/7, A Study of Normal Operational Landing Performance on Subsonic, Civil,
Narrow-Body Jet Aircraft During Instrument Landing Approaches. DOT/FAA, Washington, DC, 2007.
[38] John W. Rustenburg, Daniel O. Tipps, Donald Skinn A Comparison of Landing Parameters from Manual
and Automatic Landings of Airbus A320 Aircraft. UDRI - University of Dayton Research Institute, Day-
ton, OH, 2001.
[39] Airbus Industries, Aircraft Maintenance Manual - Airbus A320. Toulousse/Blagnac, France, 2011.
[40] André Simões, Programming/Decoding of ight data from an Airline's A320-200 Fleet. IST, Lisboa,
Portugal, 2010.
104