Emotion recognition support system: Where physicians and psychiatrists meet linguists and data engineers

doi:10.5498/wjp.v13.i1.1

Advanced Search

BPG is committed to discovery and dissemination of knowledge

Home / Archive / Volume 13, Issue 1

This Article

Academic Content and Language Evaluation of This Article

CrossCheck and Google Search of This Article

Academic Rules and Norms of This Article

Supplementary Materials of This Article

Citation of this article

Corresponding Author of This Article

Research Domain of This Article

Article-Type of This Article

Open-Access Policy of This Article

Times Cited Counts in Google of This Article

Number of Hits and Downloads for This Article

Total Article Views (6720)

All Articles published online

The chart showing PDF series, WORD series, HTML series, Figures (1-3) series, Tables (1-3) series.

Item

Count

PDF

276

WORD

HTML

2796

Figures (1-3)

585

Tables (1-3)

627

Sum=4337

Featured Article

The chart showing Browse series, Download series.

Item

Count

Browse

284

Download

866

Sum=1150

Publishing Process of This Article

Item

Count

Browse

245

Download

990

Sum=1235

Jan 19, 2023 (publication date) through Aug 15, 2025

Times Cited of This Article

Times Cited (5)

Journal Information of This Article

Publication Name

World Journal of Psychiatry

ISSN

2220-3206

Publisher of This Article

Baishideng Publishing Group Inc, 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA

Review

World J Psychiatry. Jan 19, 2023; 13(1): 1-14
Published online Jan 19, 2023. doi: 10.5498/wjp.v13.i1.1

Open in New Tab Full Size Figure Download Figure

Figure 1 Emotion indicators in the patient-doctor interaction.

Open in New Tab Full Size Figure Download Figure

Figure 2 Spectrograms of the Persian word (sahar) pronounced by a Persian female speaker in neutral (top) and anger (down) situations. Figure 2 shows spectrograms of the word (sahar), spoken by a native female speaker of Persian. The figure illustrates a couple of important differences between acoustic representations of the produced speech sounds. For example, the mean fundamental frequency in anger situations is higher (225 Hz) than that observed for neutral situations (200 Hz). Additionally, acoustic features such as mean formant frequencies (e.g. F1, F2, F3, and F4), minimum and maximum of the fundamental frequency, and mean intensity are lower in neutral situations. More details are provided in Table 1.

Open in New Tab Full Size Figure Download Figure

Figure 3 Integrated platform for patient emotion recognition and decision support. It consists of the data gathering platform and the intelligent processing engines. Each patient’s data, in the form of voice/transcripts is captured, labeled, and stored in the dataset. The resulting dataset feeds the machine language training/validation and test engines. The entire process of intelligent processing may iterate several times for further fine tuning. It is crucial to have collaboration among the three relevant expertise in different parts of the proposed solution.

Citation: Adibi P, Kalani S, Zahabi SJ, Asadi H, Bakhtiar M, Heidarpour MR, Roohafza H, Shahoon H, Amouzadeh M. Emotion recognition support system: Where physicians and psychiatrists meet linguists and data engineers. World J Psychiatry 2023; 13(1): 1-14
URL: https://www.wjgnet.com/2220-3206/full/v13/i1/1.htm
DOI: https://dx.doi.org/10.5498/wjp.v13.i1.1