Version 6 (modified by masc01, 10 years ago) (diff)


SEMAINE-3.0: The full Sensitive Artificial Listener system

Released 24 September 2010.

The aim of the SEMAINE project is to build a Sensitive Artificial Listener (SAL) – a multimodal dialogue system with the social interaction skills needed for a sustained conversation with a human user. SEMAINE-3.0 is a full implementation of a SAL. This video illustrates the concept.


As a pre-condition for installing and running the SEMAINE-3.0 system, make sure that you have suitable hardware and that you have installed the Required Software.

The open source parts of the system can be downloaded from the SEMAINE sourceforge project page. You can choose among two packages, each around 900 MB. (The files are so large because it includes four high-quality TTS voices as well as vocal emotion recognition models, which need a lot of space.)

  • SEMAINE-3.0-windows includes binary versions of the full SEMAINE-3.0 system: the System manager component, the dialogue components, the speech synthesizer MARY TTS, the Greta agent components, the Opensmile speech analysis components, and the message-oriented middleware ActiveMQ.
  • SEMAINE-3.0-source includes the source code for: the System manager component, the dialogue components, and the Opensmile speech analysis components, as well as binary versions of the speech synthesizer MARY TTS, the message-oriented middleware ActiveMQ and all dependencies needed to compile the code. It can be used to compile the system from source under Linux, Mac and Windows.

To run the full audio-visual SEMAINE-3.0 system, the windows package is required. The system can run on a single, fast machine (tested on a laptop with a 2.53 GHz Core2Duo CPU with 4 GB RAM), or you can set up SEMAINE-3.0 as a distributed system.

The Video analysis components are distributed as closed-source freeware: SEMAINE Visual Components. Watch out for the Camera driver requirements if you are using a Firewire camera. If installed in the default location, the start.bat script will notice that the video analysis components are installed and will try to run them. Since they are computationally heavy, you may need an additional computer to run them.

The SEMAINE-3.0 system will work without the video analysis components, but will then not be able to pick up the same amount of information from the user.

Running the system


In its simplest form, the system can be run on a single (fast) Windows machine by installing all system components on the same computer as described above. The system is then run by starting the following batch file:


This will start ActiveMQ, wait until it is started, and then start all other installed components. If the system does not start correctly, double-check that you have unpacked both the windows and the java components in the same folder, and that you have met all the requirements.

To stop all components of the system, call


The system with windows and java open source components runs OK on a Core2Duo with 2.53 GHz and 4 GB RAM. When the video analysis components are added, the system is running but very slow. Therefore, it is recommended to run SEMAINE-3.0 as a distributed system on several computers.

All platforms

Whereas the video input and output components are available for Windows only, there is a configuration of the system that runs on all platforms -- the speech2speech system, with only speech input and speech output.

This system can be started as follows.

  • In one shell, start SEMAINE-3.0/apache-activemq-5.3.0/bin/activemq(.bat);
  • In a second shell, start the java components as SEMAINE-3.0/bin/ (or .bat);
  • In a third shell, start opensmile as SEMAINE-3.0/bin/run_components/start_component_tum.opensmile (linux/mac) or SEMAINE-3.0\Opensmile\start_openSMILE.bat (windows).

In this configuration the java components are started with a different config file, which loads a stack of audio-only output components in java.


The SEMAINE API for Java and C++, the SEMAINE dialogue components (in Java), and the speech synthesizer MARY TTS are distributed under the GNU Lesser General Public License (LGPL), version 3. The speech synthesis voices for the SAL agents are distributed under the Creative Commons ShareAlike - No Derivatives license.

The 3D agent animation software Greta and the speech analysis software Opensmile are distributed under the GNU General Public License (GPL).

The separately installable SEMAINE Video components for camera image analysis come as a freeware binary.

Developer documentation

With some effort you can build the components from source.

Detailed documentation of the SEMAINE API is available in a number of documents:

Mailing list

There is a public SEMAINE-users mailing list at Feel free to ask questions there.

Background Information

Detailed information on the system and the underlying software architecture can be found in the following set of public project deliverable reports:

See also the slightly older set of reports on SEMAINE-2.0, which contain complementary information.

Furthermore, information about the data collected in the project can be found in the following reports: