Gao Zheng, Chih-Yu Wang, Vasilis Friderikos, Mischa Dohler
High Mobility Multi Modal E-Health Services
IEEE International Conference on Communications (ICC)
May 2018
In emergency medical services, the lag time between injury and treatment is one of the most critical parameters with respect to patient survivability. Ambulance services aim to maximize the likelihood of prompt medical treatment to prevent death and/or potential non-reversible damages. The emerging Tactile Internet has a vital role to play on that frontier by allowing next generation of ambulances to be equipped with advanced haptic/tactile devices to allow pre-hospital treatment/diagnosis or even remote surgery while en route. In this paper we propose a novel reliable multi-modal e-health high mobility service optimization framework for ambulances utilizing mobile edge clouds to efficiently transport real time patient information to the hospital. The main challenge of the proposed e-health service is to guarantee the heterogeneous QoS requirements of all involved data flows between the ambulance and the medical personnel. To this end, we formulate the service configuration problem as an optimization problem. In addition, a set of low-complexity algorithms are proposed to provide competitive solutions in real-time. A comprehensive set of numerical investigations are presented to characterize the attainable system performance of the proposed schemes.
Chih-Yu Wang, Yan Chen, K.J. Ray Liu
Game-Theoretic Cross Social Media Analytic: How Yelp Ratings Affect Deal Selection on Groupon?
IEEE Transactions on Knowledge and Data Engineering
May 2018
Deal selection on Groupon is a typical social learning and decision making process, where the quality of a deal is usually unknown to the customers. The customers must acquire this knowledge through social learning from other social medias such as reviews on Yelp. Additionally, the quality of a deal depends on both the state of the vendor and decisions of other customers on Groupon. How social learning and network externality affect the decisions of customers in deal selection on Groupon is our main interest. We develop a data-driven game-theoretic framework to understand the rational deal selection behaviors cross social medias. The sufficient condition of the Nash equilibrium is identified. A value-iteration algorithm is proposed to find the optimal deal selection strategy. We conduct a year-long experiment to trace the competitions among deals on Groupon and the corresponding Yelp ratings. We utilize the dataset to analyze the deal selection game with realistic settings. Finally, the performance of the proposed social learning framework is evaluated with real data. The results suggest that customers do make decisions in a rational way instead of following naive strategies, and there is still room to improve their decisions with assistance from the proposed framework.
Y.-S. Huang, S.-Y. Chou And Y.-H. Yang
Generating music medleys via playing music puzzle games
AAAI Conference on Artificial Intelligence
February 2018
Generating music medleys is about finding an optimal permutation of a given set of music clips. Toward this goal, we propose a self-supervised learning task, called the music puzzle game, to train neural network models to learn the sequential patterns in music. In essence, such a game requires machines to correctly sort a few multisecond music fragments. In the training stage, we learn the model by sampling multiple nonoverlapping fragment pairs from the same songs and seeking to predict whether a given pair is consecutive and is in the correct chronological order. For testing, we design a number of puzzle games with different difficulty levels, the most difficult one being music medley, which requiring sorting fragments from different songs. On the basis of state-of-the-art Siamese convolutional network, we propose an improved architecture that learns to embed frame-level similarity scores computed from the input fragment pairs to a common space, where fragment pairs in the correct order can be more easily identified. Our result shows that the resulting model, dubbed as the similarity embedding network (SEN), performs better than competing models across different games, including music jigsaw puzzle, music sequencing, and music medley. Example results can be found at our project website, https://remyhuang.github.io/DJnet.
Hao-Wen Dong, Wen-Yi Hsiao, Li-Chia Yang, Yi-Hsuan Yang
MuseGAN: Multi-track sequential generative adversarial networks for symbolic music generation and accompaniment
AAAI Conference on Artificial Intelligence
February 2018
Generating music has a few notable differences from generating images and videos. First, music is an art of time, necessitating a temporal model. Second, music is usually composed of multiple instruments/tracks with their own temporal dynamics, but collectively they unfold over time interdependently. Lastly, musical notes are often grouped into chords, arpeggios or melodies in polyphonic music, and thereby introducing a chronological ordering of notes is not naturally suitable. In this paper, we propose three models for symbolic multi-track music generation under the framework of generative adversarial networks (GANs). The three models, which differ in the underlying assumptions and accordingly the network architectures, are referred to as the jamming model, the composer model and the hybrid model. We trained the proposed models on a dataset of over one hundred thousand bars of rock music and applied them to generate piano-rolls of five tracks: bass, drums, guitar, piano and strings. A few intratrack and inter-track objective metrics are also proposed to evaluate the generative results, in addition to a subjective user study. We show that our models can generate coherent music of four bars right from scratch (i.e. without human inputs). We also extend our models to human-AI cooperative music generation: given a specific track composed by human, we can generate four additional tracks to accompany it. All code, the dataset and the rendered audio samples are available at https://salu133445.github.io/musegan/.
Li-Chia Yang, Szu-Yu Chou, Yi-Hsuan Yang,
MidiNet: A convolutional generative adversarial network for symbolic-domain music generation
Proc. Int. Society for Music Information Retrieval Conf. (ISMIR)
October 2017
Most existing neural network models for music generation use recurrent neural networks. However, the recent WaveNet model proposed by DeepMind shows that convolutional neural networks (CNNs) can also generate realistic musical waveforms in the audio domain. Following this light, we investigate using CNNs for generating melody (a series of MIDI notes) one bar after another in the symbolic domain. In addition to the generator, we use a discriminator to learn the distributions of melodies, making it a generative adversarial network (GAN). Moreover, we propose a novel conditional mechanism to exploit available prior knowledge, so that the model can generate melodies either from scratch, by following a chord sequence, or by conditioning on the melody of previous bars (e.g. a priming melody), among other possibilities. The resulting model, named MidiNet, can be expanded to generate music with multiple MIDI channels (i.e. tracks). We conduct a user study to compare the melody of eight-bar long generated by MidiNet and by Google’s MelodyRNN models, each time using the same priming melody. Result shows that MidiNet performs comparably with MelodyRNN models in being realistic and pleasant to listen to, yet MidiNet’s melodies are reported to be much more interesting.
D. K. Verma, R. Y. Chang, And F.-T. Chien
Energy-Assisted Decode-and-Forward for Energy Harvesting Cooperative Cognitive Networks
IEEE Transactions on Cognitive Communications and Networking
September 2017
In this paper, we consider a simultaneous wireless information and power transfer (SWIPT)-enabled cooperative cognitive network that addresses energy scarcity and spectral scarcity, two important issues in 5G wireless communications. In the considered network, the self-sustainable, SWIPT-enabled relay assists primary user's transmission, while the relay itself is also a secondary user with its own information superimposed on the regenerated primary information for transmission. The SWIPT relay employs the proposed energy-assisted decode-and-forward (EDF) protocol, which enhances the conventional decode-and-forward (DF) protocol with energy-dimension-augmented information decoding. We conduct a comparative analysis of the proposed EDF and the conventional DF and amplify-and-forward (AF) protocols in this SWIPT cooperative cognitive framework in terms of capacity, outage probability, and throughput for both primary and secondary networks. Simulation corroborates the analysis and demonstrates performance advantages of EDF over DF/AF from various perspectives.
Chih-Yu Wang, Yan Chen, K.J. Ray Liu
Hidden Chinese Restaurant Game: Grand Information Extraction for Stochastic Network Learning
IEEE Transactions on Signal and Information Processing over Networks
June 2017
Agents in networks often encounter circumstances requiring them to make decisions. Nevertheless, the effectiveness of the decisions may be uncertain due to the unknown system state and the uncontrollable externality. The uncertainty can be eliminated through learning from information sources, such as user-generated contents or revealed actions. Nevertheless, the user-generated contents could be untrustworthy since other agents may maliciously create misleading contents for their selfish interests. The passively revealed actions are potentially more trustworthy and also easier to be gathered through simple observations. In this paper, we propose a new stochastic game-theoretic framework, Hidden Chinese Restaurant Game (H-CRG), to utilize the passively revealed actions in stochastic social learning process. We propose grand information extraction, a novel Bayesian belief extraction process, to extract the belief on the hidden information directly from the observed actions. We utilize the coupling relation between belief and policy to transform the original continuous belief-state Markov decision process (MDP) into a discrete-state MDP. The optimal policy is then analyzed in both centralized and game-theoretic approaches. We demonstrate how the proposed H-CRG can be applied to the channel access problem in cognitive radio networks. We then conduct data-driven simulations using the CRAWDAD Dartmouth campus wireless local area network (WLAN) trace. The simulation results show that the equilibrium strategy derived in H-CRG provides higher expected utilities for new users and maintains a reasonable high social welfare comparing with other candidate strategies.
A. Chern, Y.-H. Lai, Y. Chang, Y. Tsao, R. Y. Chang, And H.-W. Chang
A Smartphone-Based Multi-Functional Hearing Assistive System to Facilitate Speech Recognition in the Classroom
IEEE Access
June 2017
In this paper, we propose a smartphone-based hearing assistive system (termed SmartHear) to facilitate speech recognition for various target users who could benefit from enhanced listening clarity in the classroom. The SmartHear system consists of transmitter and receiver devices (e.g., smartphone and Bluetooth headset) for voice transmission, and an Android mobile application that controls and connects the different devices via Bluetooth or WiFi technology. The wireless transmission of voice signals between devices overcomes the reverberation and ambient noise effects in the classroom. The main functionalities of SmartHear include: 1) configurable transmitter/receiver assignment, to allow flexible designation of transmitter/receiver roles; 2) advanced noise-reduction techniques; 3) audio recording; and 4) voice-to-text conversion, to give students visual text aid. All the functions are implemented as a mobile application with an easy-to-navigate user interface. Experiments show the effectiveness of the noise-reduction schemes at low signal-to-noise ratios (SNR) in terms of standard speech perception and quality indices, and show the effectiveness of SmartHear in maintaining voice-to-text conversion accuracy regardless of the distance between the speaker and listener. Future applications of SmartHear are also discussed.
Tsun-Yi Yang, Jo-Han Hsu, Yen-Yu Lin, And Yung-Yu Chuang
DeepCD: Learning Deep Complementary Descriptors for Patch Representations
IEEE International Conference on Computer Vision (ICCV), Poster Session
October 2017
This paper presents the DeepCD framework which learns a pair of complementary descriptors jointly for image patch representation by employing deep learning techniques. It can be achieved by taking any descriptor learning architecture for learning a leading descriptor and augmenting the architecture with an additional network stream for learning a complementary descriptor. To enforce the complementary property, a new network layer, called data-dependent modulation (DDM) layer, is introduced for adaptively learning the augmented network stream with the emphasis on the training data that are not well handled by the leading stream. By optimizing the proposed joint loss function with late fusion, the obtained descriptors are complementary to each other and their fusion improves performance. Experiments on several problems and datasets show that the proposed method is simple yet effective, outperforming state-of-the-art methods
Han-Yi Lin, Pi-Cheng Hsiu, And Tei-Wei Kuo
ShiftMask: Dynamic OLED Power Shifting Based on Visual Acuity for Interactive Mobile Applications
IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)
July 2017
OLED power management on mobile devices is very challenging due to the dynamic nature of human-screen interaction. This paper presents the design, algorithms, and implementation of a lightweight mobile app called ShiftMask, which allows the user to dynamically shift OLED power to the portion of interest, while dimming the remainder of the screen based on visual acuity. To adapt to the user’s focus of attention, we propose efficient algorithms that consider visual fixation in static scenes, as well as changes in focus and screen scrolling. The results of experiments conducted on a commercial smartphone with popular interactive apps demonstrate that ShiftMask can achieve substantial energy savings, while preserving acceptable readability.