as it uses server-based computing through network
with high speed and bandwidth. It can likewise
realize the low cost of robot by reduced computing
power and to increase availability through
providing unlimited services. Therefore, as ASR and
TTS engine using many resources of robot execute
in the URC system to serve user speech recognition
and synthesis, it is possible to support unlimited lists
of recognition words and speech synthesis for many
languages.
Figure 2 shows the block diagram of URC Main
server which is in charge of an interface between
URC Robot and ASR/TTS engine servers. ASR/TTS
engine can be constructed a separate server or not.
3 PERFORMANCE EVALUATION
OF URC SYSTEM WITH ASR,
TTS SERVER
This section describes a test environment, scenario
and results for evaluating performance about server-
based ASR/TTS system. The URC server system for
ubiquitous robot satisfies requirements for real-time
as follows:
(1) [Requirement 1] Average response time less than
1 second for request message of clients and/or robots.
(2) [Requirement 2] Providing sessions (clients +
robots) more than minimum 100 per 1 server.
Above-mentioned conditions are minimum
requirements to actually apply URC system to fields
(home).
3.1 Experimental Environment and
Scenario
An experimental scenario divides 2 cases according
to the location of ASR and TTS engine servers.
(1) [Scenario 1] The ASR/TTS engine servers are
located in local machine with the URC Main
software.
(2) [Scenario 2] The ASR/TTS engine servers are
separated to external server from machine with the
URC Main software through network.
Figure 3 and 4 show the structure of the URC
system model for [Scenario 1 and 2].
In the figure 3 and 4, URC Man server, ASR/TTS
servers and virtual robots have IP addresses in the
same subnet. We regarded 1 transaction time until
receiving a corresponding response message after
the virtual robot sends a request message to the URC
Main server as RTT (Round-Trip Time). The
[Requirement 1] means that it satisfies average RTT
≤ 1.
A hardware specification of URC Main server, ASR
and TTS server, operating system and software
specification for each [Scenario] is listed in Table 1.
Table 1: Specification of Servers (URC Main, ASR, TTS).
URC Main
Server
ASR
Server
TTS
Server
Hardware
specification
CPU : Intel® Xeon™ Processor 3.2
GHz/1M, EM64T, 800MHz FSB * 2EA
Memory : 4GB, DDR-2 400MHz ECC
HDD : 146GB Ultra320 SCSI
LAN : 100Mbps
Operating
System
Redhat Enterprise Linux AS(kernel 2.4.21-
4.ELsmp)
etc 9 ASR engine : HCILab[5] ASR software
(Korean version, Independent Speaker,
10,000 words support)
9 TTS engine : HCILab[5] TTS software
(Korean version)
We constructed a test scenario to verify whether
URC system model for [Scenario 1 and 2] satisfies
[Requirement 1 and 2] or not. Figure 5 shows a
sequential diagram for test scenarios. However, all
case of [Scenario 1] except “URC Main server”,
“ASR server” and “TTS server” are operated in 1
machine is identical.
1. Robots over 100 are connected to the URC Main
server. The robot to be established the connection is
completed an authentication step from the URC
Main server.
2. Each robot receives a speech input, “What is the
URC?” from user and transmits it to the URC Main
server after the robot converts to WAV file.
3. The URC Main server receives the WAV file and
transfers to an ASR server to request speech
recognition.
4. The ASR server sends a text (string), symbol and
score to the URC Main server as result after he
processes speech recognition.
5. The URC Main server decides a response string
to have to transmit with the recognized result.
6. The URC Main server transfers the response
string to TTS server to convert speech output, which
is WAV file, “I’m going to tell you about the URC.”
7. The URC Main server is obtained a response
WAV file by TTS server and then transfers the
robot.
A STUDY ON ASR/TTS SERVER ARCHITECTURE FOR NETWORK ROBOT SYSTEM
279