Salesforce

Cognitive engine capabilities

« Go Back
Information
Cognitive engine capabilities
000003959
Public
Product Selection
aiWare - aiWare
Article Details

Within each engine class is a set of capabilities, which are based on what type of data they output.

ClassCapabilityDescription
AudioAudio fingerprintingRecognizes a specific audio segment, such as a radio advertisement, as it appears in a longer audio file or on its own.
BiometricsFace detectionDetects the presence of one or multiple faces in an image or video.
BiometricsFace recognitionIdentifies one or multiple people in an image or video by associating each individual's face to their name.
BiometricsSpeaker verificationDetermines the similarity between the speaker's voice in an audio file to the voice of a person with a specified username. In enroll mode, the engine enrolls the speaker's voice into the library under the username.
DataCorrelationAssociates two data products based on some commonality, such as occurrence over time. For example, may associate weather data on a given date with stock prices on that date.
DataGeolocationIdentifies the geographic location of a person or object in the real world or some virtual equivalent.
DataBrand safetyProcesses media to determine where content falls on a scale of sensitivity or concern.
Facial featuresFacial featuresComputes metrics pertaining to face movement using a series of face landmarks and audio.
SpeechSpeaker detectionaka Speaker Separation, Diarization. Partitions an input audio stream into segments according to who is speaking when.
SpeechSpeaker recognitionaka Speaker Identification. Identifies speakers in an audio file based on trained recordings of their voice.
SpeechTranscriptionConverts speech audio to text.
TextAnomaly detectionAssigns a value to each item in a time-series according to how anomalous the object is.
TextContent classificationCategorizes one or multiple documents according to a pre-defined ontology.
TextEntity extractionaka Named-entity recognition. Classifies named entities located in unstructured text into pre-defined categories such as people, organizations and locations.
TextKeyword extractionIdentifies key terms and/or phrases that appear in documents, based on parts of speech, salience, or other criteria.
TextLanguage identificationDetects one or multiple natural languages in text.
TextSentiment analysisClassifies text according to sentiment. May include a score representing negative, neutral or positive, or include a wider breadth of tags such as "happy" or "excited."
TextSummarizationGenerates a summary of written text.
TextText extractionExtract textual information from documents, and expresses that extracted text in a structured format.
TextTranslationTranslates natural language from a text source. Includes translating plain text, rich text, extracted text, recognized text (OCR), and transcripts.
VerificationFace verificationDetermines the similarity between the face in an image to the face of a specified username. In enroll mode, the engine enrolls the face image into the library under the username.
VerificationSpeaker verificationDetermines the similarity between the speaker's voice in an audio file to the voice of a person with a specified username. In enroll mode, the engine enrolls the speaker's voice into the library under the username.
VisionImage classificationClassifies the entire image rather than objects within an image, such as "landscape" or "basketball game."
VisionLicense plate recognition (ALPR)Produces a text string of alphanumeric characters for each license plate recognized in an image or video.
VisionLogo detectionRecognizes one or more logos or branding elements in an image or video.
VisionObject detectionDetects one or multiple objects or concepts in an image or video from a general/broad ontology, such as "car" or "person."
VisionText recognitionaka Optical Character Recognition. Converts alphanumeric characters to text in a document, image, or video.
Additional Technical Documentation Information
Properties
1/10/2024 11:30 PM
1/16/2024 9:51 PM
1/16/2024 9:51 PM
Documentation
Documentation
000003959
Translation Information
English

Powered by