mexca.data
Objects for storing multimodal data.
Module Contents
Classes
Video annotation class for storing facial features. |
|
Class for storing voice features. |
|
Class for storing speech segment data. |
|
Class for storing speaker and speech segment annotations. |
|
Class for storing transcription data. |
|
Class for storing audio transcriptions. |
|
Class for storing sentiment data. |
|
Class for storing sentiment scores of transcribed sentences. |
|
Class for storing multimodal features. |
- class mexca.data.VideoAnnotation[source]
Video annotation class for storing facial features.
- Parameters:
frame (list, optional) – Index of each frame.
time (list, optional) – Timestamp of each frame in seconds.
face_box (list, optional) – Bounding box of a detected face. Is numpy.nan if no face was detected.
face_prob (list, optional) – Probability of a detected face. Is numpy.nan if no face was detected.
face_landmarks (list, optional) – Facial landmarks of a detected face. Is numpy.nan if no face was detected.
face_aus (list, optional) – Facial action unit activations of a detected face. Is numpy.nan if no face was detected.
face_label (list, optional) – Label of a detected face. Is numpy.nan if no face was detected.
face_confidence (list, optional) – Confidence of the face_label assignment. Is numpy.nan if no face was detected or only one face label was assigned.
- class mexca.data.VoiceFeatures[source]
Class for storing voice features.
Features are stored as lists (like columns of a data frame). Optional features are initialized as empty lists.
- Parameters:
- class mexca.data.SpeakerAnnotation(intervals: List[intervaltree.Interval] = None)[source]
Bases:
intervaltree.IntervalTreeClass for storing speaker and speech segment annotations.
Stores speech segments as
intervaltree.Intervalin anintervaltree.IntervalTree. Speaker labels are stored in SegmentData objects in the data attribute of each interval.- classmethod from_pyannote(annotation: Any)[source]
Create a SpeakerAnnotation object from a
pyannote.core.Annotationobject.- Parameters:
annotation (pyannote.core.Annotation) – Annotation object containing speech segments and speaker labels.
- class mexca.data.AudioTranscription(filename: str, subtitles: Optional[intervaltree.IntervalTree] = None)[source]
Class for storing audio transcriptions.
- Parameters:
filename (str) – Name of the transcribed audio file.
subtitles (intervaltree.IntervalTree, optional, default=None) – Interval tree containing the transcribed speech segments split into sentences as intervals. The transcribed sentences are stored in the data attribute of each interval.
- class mexca.data.SentimentAnnotation(intervals: List[intervaltree.Interval] = None)[source]
Bases:
intervaltree.IntervalTreeClass for storing sentiment scores of transcribed sentences.
Stores sentiment scores as intervals in an interval tree. The scores are stored in the data attribute of each interval.
- class mexca.data.Multimodal(filename: str, duration: Optional[float] = None, fps: Optional[int] = None, fps_adjusted: Optional[int] = None, video_annotation: Optional[VideoAnnotation] = None, audio_annotation: Optional[SpeakerAnnotation] = None, voice_features: Optional[VoiceFeatures] = None, transcription: Optional[AudioTranscription] = None, sentiment: Optional[SentimentAnnotation] = None, features: Optional[pandas.DataFrame] = None)[source]
Class for storing multimodal features.
See the Output section for details.
- Parameters:
filename (str) – Name of the file from which features were extracted.
duration (float, optional, default=None) – Video duration in seconds.
fps (: float) – Frames per second.
fps_adjusted (float) – Frames per seconds adjusted for skipped frames. Mostly needed for internal computations.
video_annotation (VideoAnnotation) – Object containing facial features.
audio_annotation (SpeakerAnnotation) – Object containing speech segments and speakers.
voice_features (VoiceFeatures) – Object containing voice features.
transcription (AudioTranscription) – Object containing transcribed speech segments split into sentences.
sentiment (SentimentAnnotation) – Object containing sentiment scores for transcribed sentences.
features (pandas.DataFrame) – Merged features.
- merge_features() pandas.DataFrame[source]
Merge multimodal features from pipeline components into a common data frame.
Transforms and merges the available output stored in the Multimodal object based on the ‘frame’ variable. Stores the merged features as a pandas.DataFrame in the features attribute.
- Returns:
Merged multimodal features.
- Return type:
pandas.DataFrame