JP2014143582

Patent Translate
Powered by EPO and Google
Notice
This translation is machine-generated. It cannot be guaranteed that it is intelligible, accurate,
complete, reliable or fit for specific purposes. Critical decisions, such as commercially relevant or
financial decisions, should not be based on machine-translation output.
DESCRIPTION JP2014143582
Abstract: PROBLEM TO BE SOLVED: To reduce sound leakage to the surroundings during a call. A
voice communication unit includes a voice input unit for collecting an inaudible tweets including
the internal conduction sound of tweets without vibration of vocal cords, a transmitter for
transmitting voice signals obtained by the voice input unit, and an external device. The above
problem is solved by having a receiving unit that receives an audio signal from the above and a
bone conduction output unit that outputs the audio signal obtained by the receiving unit by bone
conduction. [Selected figure] Figure 2
Intercom
[0001]
BACKGROUND OF THE INVENTION Field of the Invention The present invention relates to a
speech apparatus, and more particularly to a speech apparatus that reduces sound leakage to the
surroundings during a call.
[0002]
Conventionally, when a program is recorded in a studio of a broadcasting station, an
intercommunication type local telephone such as an intercom (Inter-communication) is used to
realize a call between staff members (see, for example, Patent Document 1). .
Since the income can be worn on the head of the user like headphones, it is possible to make an
01-05-2019
1
audio call by using a free hand.
[0003]
Japanese Patent Application Laid-Open No. 10-262162
[0004]
However, when using an income etc., since it is necessary to give a voice like normal
conversation, a voice spreads around.
As a result, in the recording studio, the voice of the staff talking is mixed in with the sound
collecting microphone for broadcasting. Further, the income is attached so as to directly block
one or both ears of the user so that the sound from the speaker does not leak to the outside.
Therefore, during a call, there is a problem that the sounds of the surroundings (in particular, the
direction in which the ears are blocked) can not be distracted and the situation of the
surroundings can not be grasped. The essence of this problem stems from the act of emitting a
voice into the air and the act of listening to the sound in the air itself.
[0005]
An object of the present invention is to provide a communication device that reduces sound
leakage to the surroundings during a call.
[0006]
In order to solve the above problems, the present invention adopts means for solving the
problems having the following features.
[0007]
The communication device according to one aspect includes a voice input unit that collects
inaudible tweets including a body conduction sound of tweets without vibrations of the vocal
cords, a transmitter that transmits voice signals obtained by the voice input unit, and an external
device. And a bone conduction output unit for outputting the sound signal obtained by the
reception unit by bone conduction.
01-05-2019
2
[0008]
According to the present invention, it is possible to reduce sound leakage to the surroundings
during a call.
[0009]
It is a figure which shows an example of schematic structure of a telephone call system.
It is a figure which shows an example of a function structure of a telephone apparatus.
It is a figure which shows the outline example of the audio ¦ voice input by a NAM microphone.
It is a figure for demonstrating the specific example of the process in a signal processing part.
It is a figure (the 1) which shows the specific example and mounting example of a telephone
apparatus. It is a figure (the 2) which shows the specific example and mounting example of a
telephone apparatus. It is a figure (the 3) which shows the specific example and mounting
example of a telephone apparatus. It is a figure (the 4) which shows the example and mounting
example of a telephone apparatus. It is a figure (the 5) which shows the example and mounting
example of a telephone apparatus. It is a figure which shows the other specific example of a
telephone apparatus.
[0010]
<Regarding the Present Invention> The communication device in the present invention realizes a
silent voice interface. The silent voice interface is a method of talking without leaking voice
around, and enables communication only between the speaker and the listener like telepathy, for
example. Therefore, even if, for example, the speaker talks to the outside so as not to be heard
around, the listener is provided with a mechanism to hear a natural voice.
[0011]
01-05-2019
3
Specifically, on the voice transmission side in the communication device, an internal conduction
voice technology such as non-audible murmur (NAM) is used as an example of the silent voice
interface. Usually, when a person speaks, the human voice is based on the sound generated by
vibrating the vocal cords. Non-audible tweets are, for example, the act of only moving the mouth
along with exhalation that does not involve vibration of the vocal cords, and non-audible tweets
can be transmitted through the internal sound of tweets that can not be heard by a third party
and do not involve vocal cord vibrations. Including. It is the NAM microphone (hereinafter
referred to as "NAM microphone") that picks up the sound conducted through the body.
[0012]
Non-audible tweets can be expected to emit a contact voice without leaking the sound to the
outside world by using the internal vibration without using air as a medium. Similar to bone
conduction, this is "meat conduction" which makes the flesh of the human body a voice
transmission medium rather than bone as a medium.
[0013]
That is, in the present invention, the voice is recorded as internal conduction voice by using the
NAM microphone in order to collect an utterance of a very small whisper of the user. As a result,
it is possible to make an utterance with as little sound leakage as possible to the surroundings as
much as possible, and it becomes possible to make an utterance even if the mouth is covered
with a high sound insulation mask or the like. Therefore, in the present invention, a mask
covering part or all of the user's mouth may be used in combination.
[0014]
Further, in the present invention, the quality of the voice such as clarity and naturalness by
statistical signal processing is improved by performing in-vivo conduction voice enhancement
processing as needed on the voice recorded by the NAM microphone. You may
[0015]
Furthermore, on the voice receiving side in the communication device, the user is made to hear
01-05-2019
4
and listen using a bone conduction speaker as an example of a silent voice interface.
As a result, it becomes possible to listen to the call voice without blocking the ear, so it is possible
to simultaneously hear surrounding sounds. As a result, it is possible to realize a telephone call in
which sound does not easily leak around without vibrating air on both the sound emitting side
and the sound receiving side. Hereinafter, a preferred embodiment of the communication device
will be described in detail with reference to the drawings.
[0016]
<Example of Schematic Configuration of Call System> FIG. 1 is a diagram showing an example of
a schematic configuration of a call system. FIG. 1 shows a call system 10 used in a broadcast
station as an example. The call system 10 has a call apparatus 11 and a sub adjustment room 12,
and the call apparatus 11 and the sub adjustment room 12 are communication networks 13 such
as LAN (Local Area Network) and the Internet using radio or wire. Are connected in a state in
which data can be transmitted and received.
[0017]
The call device 11 is a device used by a staff member such as a cameraman or assistant director
(AD) who takes a program in a studio 14 where a program etc. is recorded, a person in charge of
speech, a person in charge of lighting, etc. It has a function as an income to make a call with
other producers in the studio 14 or other staff in the studio 14 or the like.
[0018]
In the example of the call system 10 illustrated in FIG. 1, the three call devices 11-1 to 11-3 are
illustrated, but the number is not limited to this, and at least one call device 11 is illustrated. As
long as you have
Further, in the call system 10, which user is using which call apparatus 11 is managed, and it is
possible to make a call to one or a plurality of set users.
[0019]
01-05-2019
5
Here, the communication device 11 illustrated in FIG. 1 includes the NAM microphone 21, the
bone conduction speaker 22, the band unit 23, and the protection unit 24. As described above,
the NAM microphone 21 picks up the utterance of the user's very small whisper (eg, inaudible
tweets) or the like. The NAM microphone 21 is highly resistant to noise propagating in the air.
[0020]
The bone conduction speaker 22 outputs the vibration based on the voice signal received via the
communication network 13 to the user's body (bone), and transmits the sound directly to the
auditory nerve, for example, through the skull. In addition, although various systems, such as a
piezoelectric element system, a super magnetostrictive element system, and a coil element
system, can be used as a system of bone conduction in this embodiment, it is not limited to this.
By including the NAM microphone 21 and the bone conduction speaker 22, it is possible to
reduce the sound leakage to the surroundings during a call.
[0021]
The band unit 23 holds the main body of the communication device 11 between the head and the
neck of the user. The band portion 23 is formed in, for example, a semicircular shape as shown in
FIG. 1, and the semicircular portion has a predetermined elasticity. The semicircular portion can
be held in a predetermined position by fitting it on the head or neck of the user. At this time, it is
preferable that the above-described NAM microphone 21 and the bone conduction speaker 22 be
held in contact with the user, but the present invention is not limited to this.
[0022]
The band unit 23 adjusts the size and the diameter width with respect to the above-mentioned
semicircle, the positions of the NAM microphone 21 and the protection unit 24 attached to the
band unit 23 and the like according to the size of the head and neck of the user. Has an
adjustable mechanism that can Since the communication device 11 is detachably mounted by the
band unit 23, the freedom of the user's hand is secured.
01-05-2019
6
[0023]
In addition, the communication device 11 shown in FIG. 1 may have a protection unit 24 having
a predetermined elasticity so as not to be hurt when the user holds the communication device 11
in the head or neck of the user. . In addition, as the protection part 24, although elastic materials,
such as rubber and sponge, soft resin, etc., soft materials, such as cloth and cotton, etc. can be
used, for example, it is not limited to this, For example, It may be a smooth curved surface in
which the contact portion with the skin is chamfered. This makes the contact with the skin soft.
[0024]
Furthermore, the communication device 11 in the present embodiment may hold the
communication device 11 by sandwiching the head between the NAM microphone 21 and the
protection unit 24 shown in FIG. The ear hook may be held by hanging it on the ear.
[0025]
Of the operation rooms for operating the equipment provided in the various studios 14 in a
broadcasting station such as a television and a radio, the sub adjustment room 12 is connected to
a "main adjustment room" for transmitting a broadcast and a line for broadcasting, It is an
operation room for operating the equipment for program production to adjust voice, video and
the like.
[0026]
The sub adjustment room 12 performs viewing of the image sent from each camera in the studio
14 and the sound sent from each microphone.
In addition, the deputy coordination room 12 receives reports from the staff members such as
the floor director, each cameraman, AD, person in charge of voice, person in charge of lighting,
etc. in the studio 14 using speech signals via the call device 11 to judge the status of the studio
14 Do.
[0027]
01-05-2019
7
In the secondary adjustment room 12, an audio signal such as an instruction is output from the
producer or the like to the communication device 11 used by a predetermined user in the studio
14.
Since the sub adjustment room 12 is outside the studio 14, for example, a normal microphone or
a speaker may be used without using the communication device 11 as shown in the present
embodiment.
[0028]
In addition, in the call system 10 shown in FIG. 1, the call device 11 can not only make a call with
the sub adjustment room 12 but also make a call between the call devices 11. In this case, it is
possible to make a call only to the predetermined communication device 11 by switching the
frequency (channel) like a transceiver, for example.
[0029]
<The example of functional composition of speech device 11> Next, the example of functional
composition of speech device 11 mentioned above is explained using a figure. In the following
description, components having substantially the same functions as those described above are
denoted by the same reference numerals, and the specific description thereof is omitted. FIG. 2 is
a diagram showing an example of a functional configuration of the communication device. The
telephone apparatus 11 shown in FIG. 2 includes a NAM microphone 21 as an example of a voice
input unit, a bone conduction speaker 22 as an example of a bone conduction output unit, a
signal processing unit 31, a transmission unit 32, and a reception unit 33. , And a signal
conversion unit 34.
[0030]
As described above, the NAM microphone 21 picks up the inaudible sound of the user, inputs the
sound, and outputs the input sound signal to the signal processing unit 31. In the present
embodiment, the signal obtained from the NAM microphone 21 may be amplified to a
predetermined size using an amplifier (amplifying unit) or the like.
01-05-2019
8
[0031]
The signal processing unit 31 performs signal processing on the audio signal obtained from the
NAM microphone to increase the recognition rate on the listener side. Specific processing
contents in the signal processing unit 31 will be described later. The signal processed by the
signal processing unit 31 is transmitted to the transmission unit 32. In the present embodiment,
the processing in the signal processing unit 31 may not be performed on the audio signal
obtained from the NAM microphone 21. For example, the user may set in advance whether or not
the processing by the signal processing unit 31 is performed, and it is estimated that the
recognition rate is low from the size and frequency characteristics of the sound input by the
NAM microphone 21. May be set to perform processing by the signal processing unit 31.
[0032]
The transmission unit 32 transmits an audio signal obtained directly from the signal processing
unit 31 or the NAM microphone 21 to a predetermined transmission destination (for example,
the sub adjustment room 12 or another call device 11). The transmission unit 32 may transmit
an audio signal to one or more preset transmission destinations, and further includes a selection
unit for selecting which transmission destination to transmit, and the transmission unit 32 is
selected by the selection unit. An audio signal may be transmitted to the destination.
[0033]
The reception unit 33 receives an audio signal from a predetermined reception destination (for
example, the sub adjustment room 12 or another call device 11). Note that the receiving unit 33
may receive an audio signal from a preset receiving destination, and is provided with a selecting
unit for selecting which receiving destination to receive, and from the receiving destination
selected by the selecting unit An audio signal may be received.
[0034]
The signal conversion unit 34 converts the voice signal received by the reception unit 33 into a
vibration signal to be transmitted to the bone of the user. In addition, although it is preferable
01-05-2019
9
that conversion of a signal is made to correspond to, for example, a method of bone conduction
in the bone conduction speaker 22 or the like, the conversion is not limited to this.
[0035]
<Regarding the NAM Microphone 21> Here, for the above-described NAM microphone 21, for
example, the methods described in Japanese Patent No. 3760173 and Japanese Patent No.
4702975 can be used. FIG. 3 is a diagram showing a schematic example of voice input by the
NAM microphone.
[0036]
The communication device 11 in the present embodiment is mounted so as to sandwich the neck
and head by the NAM microphone 21 and the protection unit 24 at both ends of the band unit
23 as shown in FIG. Take a signal. The NAM microphone 21 is, for example, a microphone (a
meat conduction microphone) that is a sound without regular vibration of the vocal cords and
that collects vibration sound (breathing sound) that conducts (meat conduction) in inaudible
internal soft tissue from outside. .
[0037]
Specifically, the NAM microphone 21 has a soft silicon portion 51, a vibration sensor 52, an
electrode 53, and a sound insulation cover 54. The soft silicon portion 51 is a soft member (for
example, a silicon member or the like) in contact with the skin of the user 40, and generates
vibration as air vibration in the vocal tract of the user 40 (meat conduction). It outputs to the
vibration sensor 52. The vocal tract is, for example, as shown in FIG. 3, an airway portion (a
portion including an oral cavity and a nasal cavity to the lip) on the downstream side of the
discharge direction of breathing than the vocal cords.
[0038]
The vibration sensor 52 contacts the soft silicon portion 51 and converts the vibration of the soft
silicon portion 51 into an electrical signal. The electrical signal obtained by the vibration sensor
01-05-2019
10
52 is output to the outside through the electrode 53.
[0039]
The sound insulation cover 54 is a soundproofing material that prevents vibration transmitted
through the surrounding air other than the skin in contact with the soft silicon portion 51 from
being transmitted to the soft silicon portion 51 or the vibration sensor 52.
[0040]
The NAM microphone 21 is, for example, mounted so as to contact the skin surface on the chest
papillary muscle, just below the mastoid of the skull in the lower part of the pinna of the user 40,
as shown in FIG.
As a result, the vibration generated in the vocal tract (that is, the vibration of inaudible tweets) is
propagated to the soft silicon portion 51 substantially at the shortest through the portion (the
meat portion) where the bone does not exist in the speaker. The mounting position of the NAM
microphone 21 is not limited to this, and may be near the vocal tract or near the vocal cords, for
example.
[0041]
<The specific example of the process in the signal processing part 31> Next, the specific example
of the process in the signal processing part 31 mentioned above is demonstrated using figures.
FIG. 4 is a diagram for describing a specific example of processing in the signal processing unit.
In this embodiment, in order to improve the quality so that real-time in-vivo conducted speech
such as inaudible muffled sounds can be easily recognized on the listener side, speech
enhancement processing using a conversion model by statistical signal processing is performed.
[0042]
Specifically, as shown in FIG. 4, a plurality of (for example, 50 or more) previously set sentences
having the same content, a meat-conducted voice using the NAM microphone 21, and a normal
voice (natural voice emitted from the mouth And learning using the same speech data of meat
01-05-2019
11
conduction speech and natural speech to generate a transformation model. Specifically, in each
of meat-conducted speech and natural speech, spectral feature quantities are extracted in section
units separated by a time frame of about several tens of milliseconds, and based on the extracted
spectral feature quantities, By learning the correspondence relationship, a conversion model for
converting meat conduction speech to natural speech is generated.
[0043]
The signal processing unit 31 can convert the flesh-conducted speech into natural speech by
using the above-described conversion model for the speech obtained from the NAM microphone
21. As described above, the processing in the signal processing unit 31 can improve the speech
recognition rate on the listener side (voice reception side).
[0044]
As a method of generating a conversion model, for example, the method described in Japanese
Patent No. 4940414 can be used, but the method is not limited to this, and other methods may
be used. In the method of Japanese Patent No. 4940414, a predetermined feature amount is
calculated for each of the learning input signal of inaudible speech and the learning output signal
of audible whisper speech corresponding to the learning input signal recorded by the
microphone. Based on the calculation result, learning calculation of model parameters in the
vocal tract feature quantity conversion model is performed to convert the feature quantity of the
inaudible voice signal into the feature quantity of the audible whisper voice signal, and the model
parameter after learning is specified. In the storage means of Next, the feature quantity is
calculated for the input inaudible voice signal, and the audible whisper voice corresponding to
the input inaudible voice signal is calculated based on the calculation result and the vocal tract
feature value conversion model in which the model parameter after learning is set. The feature
amount of the signal is calculated, and an audible whisper sound signal corresponding to the
input inaudible sound signal is generated.
[0045]
Further, in the present embodiment, the processing by the signal processing unit 31 is performed
only when it is estimated that the recognition rate is low from, for example, the size and
frequency characteristics of the sound input by the NAM microphone 21 or when the user sets it.
Otherwise, the processing by the signal processing unit 31 may not be performed.
01-05-2019
12
[0046]
<Specific Example and Mounting Example of Communication Device 11> Next, a specific example
and mounting example of the communication device 11 will be described.
5 to 9 are diagrams (Nos. 1 to 5) showing a specific example and a mounting example of the
communication device. 5 (A) to FIG. 9 (A) show specific examples of the call devices 11a to 11e,
and FIG. 5 (B) to FIG. 9 (B) show installation examples of the call devices 11a to 11e. There is.
[0047]
In the communication device 11a shown in the example of FIG. 5A, the bone conduction speaker
22 is formed as an earphone 61, and is connected to another configuration (for example, the
NAM microphone 21 etc.) of the communication device 11a by the cable 62. . As shown in FIG. 5
(B), the communication device 11a holds the band 23 on the neck of the user 40, and the NAM
microphone 21 and the protection member 24 near the both ends of the band 23 determine the
predetermined condition of the user 40. The NAM microphone 21 is attached so as to be in close
contact with a predetermined site by sandwiching a site (for example, the vicinity of the vocal
cords and the lower part of the pinna of the left and right). By mounting in this manner, the NAM
microphone 21 can mainly collect only the sound propagating through the soft composition of
the human body, so even if the ambient noise (noise noise) is large. The human body functions as
a noise removal filter, and can obtain an audio signal with a high SN ratio. In the example of FIG.
5, since the bone conduction speaker 22 is in the shape of an earphone, it is easier to hear
surrounding sounds than closing the entire ear in the shape of a headphone.
[0048]
Further, in the example of FIG. 6A, the ear hooking portion 63 is provided to the bone conduction
speaker 22 of the communication device 11b as compared with the communication device 11a
shown in FIG. 5 described above. With such a configuration, as shown in FIG. 6 (B), the
communication device 11a places the ear hooking portion 63 on the user 40's ear so that the
bone conduction speaker 22 is in contact with the front of the ear. Installing. As a result, both
ears are not completely blocked, and it is possible to properly grasp surrounding sounds while
talking.
01-05-2019
13
[0049]
In addition, the structure of the bone conduction speaker 22 which both ears do not block is not
limited to the structure which has the ear hook part 63 shown to FIG. 6 (A) and (B) mentioned
above, For example, use in the bone conduction speaker 22 An adhesive member, a suction cup,
or the like which can be attached can be provided on the contact surface with the skin of the
person 40 and attached so as to be in close contact with the lower ear of the user 40 or the
vicinity of the temple.
[0050]
Further, in the example of FIG. 7A, the ear hooks 63 can be provided at both ends of the band 23
of the communication device 11 c, and the bone conduction speaker 22 can be provided in each
ear hook 63.
In this case, since the communication device 11c can be fixed by the band portion 23 and the ear
hook portions 63, the protection portion 24 can be made unnecessary. In addition, in the
communication apparatus 11c shown in FIG. 7, although the bone conduction speaker 22 is
provided in the ear hooking part 63 of the both ends, it is not limited to this, Either one may be
sufficient.
[0051]
As a result, as shown in FIG. 7B, it is not necessary to strongly sandwich the NAM 21 and the
protection unit 24 at the time of mounting, and the load on the user 40 can be reduced and
mounted.
[0052]
Further, in the example of FIG. 8A, a detachable mask 64 is provided at the end of the band 23 of
the communication device 11d.
The mask 64 has a mask body 64 a and an ear hooking portion 64 b. The mask main body 64a
covers a part or all of the mouth of the user 40 at the time of wearing as shown in FIG. 8 (B). The
01-05-2019
14
mask main body 64a may be, for example, cloth-like or sheet-like. The mask main body 64a may
have removable gauze or the like on part or all of the surface in contact with the mouth. Further,
although the ear hooking portion 64 b is an elastic body such as rubber, it is not limited to this.
[0053]
In this embodiment, since it becomes possible to make a call without opening the mouth widely
and making a voice, it becomes possible to make a call even if the mask 64 is attached, and it is
possible to minimize the sound leakage to the surroundings.
[0054]
In the example of FIG. 9A, the bone conduction speaker 22 is provided in the ear hooking portion
64b of the communication device 11e.
The bone conduction speaker 22 shown in FIG. 9A can be detachably attached to any position of
the ear hook portion 64b.
[0055]
At the time of wearing shown in FIG. 9B, the bone conduction speaker 22 can be brought into
close contact at a predetermined position of the user 40 by the ear hooking portion 64 b which is
an elastic body such as rubber. Therefore, the voice signal (vibration) can be transmitted to the
user 40 with high accuracy. In the case of the example shown in FIG. 9, a cable or the like is
provided from the communication device 11e main body to the inside of the mask 64 to be
connected to another configuration (for example, the receiving unit 33 etc.) and send a signal to
the bone conduction speaker 22. be able to.
[0056]
In the example shown in FIG. 9, the bone conduction speaker 22 is provided on the mask 64.
However, the present invention is not limited to this. For example, even if the NAM microphone
21 is provided on the mask 64. Alternatively, both the NAM microphone 21 and the bone
conduction speaker 22 may be provided on the mask 64. Furthermore, when both the NAM
01-05-2019
15
microphone 21 and the bone conduction speaker 22 are provided on the mask 64, the band
portion 23 can be configured.
[0057]
In the examples shown in FIG. 5 to FIG. 9 described above, an example of the neck band type in
which the neck is held by the band portion 23 is shown, but the present invention is not limited
to this. It may be a headphone type configuration. In this embodiment, the NAM microphone 21
can collect sound most efficiently in the lower rear part of the ear, and the bone conduction
speaker 22 can transmit more efficiently if provided near the tympanic membrane or near the
skull. In the case where the communication device 11 is used, etc., a neck band type or the like is
preferable.
[0058]
Further, the above-described NAM microphone 21 is not limited to the meat conduction
microphone, and may be, for example, a bone conduction microphone or a throat microphone. In
addition, the above-described NAM microphone 21 and bone conduction speaker 22 may be
provided for the left and right ears, or a plurality of them may be provided for one of the ears.
[0059]
Further, the communication device 11 in the present embodiment may be configured by
combining a part of the configuration of each of the above-described communication devices as
shown in FIGS. FIG. 10 is a diagram showing another specific example of the telephone set. As
compared with the communication device 11a shown in FIG. 5A, the communication device 11f
shown in FIG. 10 is configured such that the bone conduction speaker 22 is detachable.
[0060]
In the example of FIG. 10, a plug (male terminal) 65 is provided at the end of the cable 62 that
comes out of the earphone 61 of the bone conduction speaker 22, and a jack (female terminal)
66 corresponding to the plug 65 is provided in the NAM microphone 21. It is provided. The
01-05-2019
16
communication device 11 f can be electrically connected to another configuration by inserting
the plug 65 into the jack 66, and can transmit and receive signals.
[0061]
With the configuration of the communication device 11f shown in FIG. 10, for example, the
earphone type bone conduction speaker 22 shown in FIG. 10 is replaced with an ear hook type
bone conduction speaker 22 as shown in FIG. The speaker 22 can be replaced or a normal
earphone that is not bone conduction can be replaced. Furthermore, by inserting nothing into the
jack 66, the communication device 11f can be used as a function of only the NAM microphone
21. Although the jack 66 is provided in the NAM microphone 21 in the example of FIG. 10, the
present invention is not limited to this. For example, the jack 66 may be provided in the band
unit 23 or the protection unit 24.
[0062]
Further, in the present embodiment, the above-described NAM microphone 21 and bones for an
ear muff or headphone having a size enough to cover the ear of the user 40 for the purpose of
blocking the ambient sound when the ambient noise is large. One or both of the conduction
speakers 22 may be embedded.
[0063]
As described above, according to the present invention, it is possible to reduce sound leakage to
the surroundings during a call.
Specifically, on the voice transmitting side of the communication device, for example, by
recording the internal conduction sound using a NAM microphone or the like, it is possible to
make a communication with a very small whisper voice. Thereby, the volume of the voiced voice
itself can be dramatically reduced compared to normal speech. Furthermore, according to the
present embodiment, sound leakage to the surroundings can be reduced as much as possible by
using a mask that covers part or all of the user's mouth.
[0064]
Also, by using statistical speech enhancement processing, the clarity and naturalness of the
01-05-2019
17
speech recorded by the NAM microphone can be enhanced, and a smoother call can be made.
[0065]
Further, on the voice receiving side of the communication device, for example, by using a bone
conduction speaker, it is possible to listen to the sent voice without blocking both ears, so it is
possible to hear surrounding sounds even while listening.
[0066]
Note that the communication device in the above-described embodiment is not limited to the
inside of a studio, and can be applied to various scenes of the broadcasting industry such as live
broadcasting and on-site reports, for example.
Further, the present invention can be applied not only to the broadcast industry but also to, for
example, a situation where a call in a crowd or a call with high confidentiality is required.
[0067]
Although the preferred embodiments of the present invention have been described above in
detail, the present invention is not limited to the specific embodiments, and various modifications
may be made within the scope of the present invention as set forth in the claims. , Change is
possible.
[0068]
DESCRIPTION OF REFERENCE NUMERALS 10 call system 11 call apparatus 12 auxiliary
adjustment room 13 communication network 14 studio 21 NAM microphone (voice input unit)
22 bone conduction speaker (bone conduction output unit) 23 band unit 24 protection unit 31
signal processing unit 32 transmission unit 33 reception unit 34 Signal converter 40 user 51 soft
silicon 52 vibration sensor 53 electrode 54 sound insulation cover 61 earphone 62 cable 63 ear
hook 64 mask 65 plug (male terminal) 66 jack (female terminal)
01-05-2019
18