資訊科技創新研究中心 | 近期研究成果

Yuan-Yao Shih, Ai-Chun Pang, And Pi-Cheng Hsiu

A Doppler Effect-Based Framework for Wi-Fi Signal Tracking in Search and Rescue Operations

IEEE Transactions on Vehicular Technology

May 2018

We consider rescue missions in postdisaster scenarios with particularly difficult environments where no infrastructure is available. Given the increasing popularity of smartphones and wearable devices, this paper proposes a rescue system which uses the Doppler effect to determine the direction of Wi-Fi signals emitted from disaster survivors' mobile devices to help rescuers quickly locate the survivors. First, we investigate the impact of the search and rescue environment on the direction-finding accuracy of Doppler effect to identify the major challenge and several implementation issues of the system. Then, to address the major challenge of Doppler shifts being too small, we propose an algorithm, which consists of three mechanisms, to solve the problem with the objective of maximizing the direction-finding accuracy. These mechanisms improve the direction-finding accuracy via eliminating the frequency fluctuation as much as possible and improving the sensitivity on small frequency shifts. Also, an active detection scheme is proposed to ensure that the survivors' devices emit steady and continuous Wi-Fi signals, along with a decision logic to minimize energy consumption by the active scheme. We implement the rescue system as a mobile application on Android smartphones and conduct extensive experiments in real-world environments. Results show that the proposed system can reduce rescue times by up to half while consuming reasonable amounts of energy from survivor smartphones.

Fredrick M Awour, Chih-Yu Wang, Tzu-Chieh Tsai

Motivating Content Sharing and Trustworthiness in Mobile Social Network

IEEE Access

May 2018

Mobile social networks (MSNs) enable users to discover and share contents with each other, especially at ephemeral events such as exhibitions and conferences where users could be strangers. Nevertheless, the incentive of users to actively share their contents in MSNs may be lacking if the corresponding cost is high. Besides, as users in MSN share contents in an impromptu way as they move, it makes them vulnerable to malicious users who may want to disseminate false contents. This is because users may not have knowledge about the peers they are socially connecting with in the network. In this paper, we propose MCoST, a mechanism that motivates content sharing in MSN and ensures that only trustworthy contents are shared. The mechanism is built on users' collective bidding, content cost sharing, and trust evaluation while guaranteeing individual rationality. MCoST enables content providers to share contents with multiple users simultaneously by utilizing the broadcast nature of wireless transmission. The cost of the content is collectively compensated by the content receivers through the content bidding mechanism in MCoST. In ensuring that users can establish the trustworthiness of their encounters' contents, MCoST incorporates a robust trust evaluation framework that guarantees that content reviews are immutable and tamper-proof, resistive to sybil, and rejection attacks, and that users cannot have multiple and fake identities in the network or reject negative reviews about their contents. This is achieved by integrating a distributed cryptographic hash-chained content review mechanism in the design of MCoST. Performance evaluation shows that the proposed mechanism efficiently evaluates contents' trustworthiness by detecting and discriminating review-chains under sybil or rejection attacks and reduces the time and cost to collect the desired contents by 86% and 40%, respectively, and improves network utilization by 50%.

Kuang-Jui Hsu, Yen-Yu Lin, And Yung-Yu Chuang

Co-attention CNNs for Unsupervised Object Co-segmentation

International Joint Conference on Artificial Intelligence (IJCAI)

July 2018

Object co-segmentation aims to segment the common objects in images. This paper presents a CNN-based method that is unsupervised and end-to-end trainable to better solve this task. Our method is unsupervised in the sense that it does not require any training data in the form of object masks but merely a set of images jointly covering objects of a specific class. Our method comprises two collaborative CNN modules, a feature extractor, and a co-attention map generator. The former module extracts the features of the estimated objects and backgrounds, and is derived based on the proposed co-attention loss, which minimizes inter-image object discrepancy while maximizing intra-image figure-ground separation. The latter module is learned to generate co-attention maps by which the estimated figure-ground segmentation can better fit the former module. Besides the co-attention loss, the mask loss is developed to retain the whole objects and remove noises. Experiments show that our method achieves superior results, even outperforming the state-of-the-art, supervised methods.

Chih-Kai Kang, Chun-Han Lin, Pi-Cheng Hsiu, And Ming-Syan Chen

HomeRun: HW/SW Co-Design for Program Atomicity on Self-Powered Intermittent Systems

IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)

July 2018

Self-powered intermittent systems featuring nonvolatile processors (NVPs) allow for accumulative execution in unstable power environments. However, frequent power failures may cause incorrect NVP execution results due to invalid data generated intermittently. This paper presents a HW/SW co-design, called HomeRun, to guarantee atomicity by ensuring that an uninterruptible program section can be run through at one execution. We design a HW module to ensure that a power pulse is sufficient for an atomic section, and develop a SW mechanism for programmers to protect atomic sections. The proposed design is validated through the development of a prototype pattern locking system. Experimental results demonstrate that the proposed design can completely guarantee atomicity and significantly improve the energy utilization of self-powered intermittent systems.

Han-Yi Lin, Chia-Chun Hung, Pi-Cheng Hsiu, And Tei-Wei Kuo

Duet: An OLED and GPU Co-management Scheme for Dynamic Resolution Adaptation

IEEE/ACM Design Automation Conference (DAC)

June 2018

The increasingly high display resolution of mobile devices imposes a further burden on energy consumption. Existing schemes manage either OLED or GPU power to save energy. This paper presents the design, algorithm, and implementation of a co-managing scheme called Duet, which automatically trades off perceptual quality for energy efficiency in accordance with static and dynamic visual acuity when users interact with mobile applications. The results of experiments conducted on a commercial smartphone with popular interactive apps show that Duet saves more energy while retaining better visual quality, compared with a joint scheme that simultaneously uses dynamic pixel disabling and dynamic resolution scaling to save OLED and GPU energy in isolation.

Gao Zheng, Chih-Yu Wang, Vasilis Friderikos, Mischa Dohler

High Mobility Multi Modal E-Health Services

IEEE International Conference on Communications (ICC)

May 2018

In emergency medical services, the lag time between injury and treatment is one of the most critical parameters with respect to patient survivability. Ambulance services aim to maximize the likelihood of prompt medical treatment to prevent death and/or potential non-reversible damages. The emerging Tactile Internet has a vital role to play on that frontier by allowing next generation of ambulances to be equipped with advanced haptic/tactile devices to allow pre-hospital treatment/diagnosis or even remote surgery while en route. In this paper we propose a novel reliable multi-modal e-health high mobility service optimization framework for ambulances utilizing mobile edge clouds to efficiently transport real time patient information to the hospital. The main challenge of the proposed e-health service is to guarantee the heterogeneous QoS requirements of all involved data flows between the ambulance and the medical personnel. To this end, we formulate the service configuration problem as an optimization problem. In addition, a set of low-complexity algorithms are proposed to provide competitive solutions in real-time. A comprehensive set of numerical investigations are presented to characterize the attainable system performance of the proposed schemes.

Chih-Yu Wang, Yan Chen, K.J. Ray Liu

Game-Theoretic Cross Social Media Analytic: How Yelp Ratings Affect Deal Selection on Groupon?

IEEE Transactions on Knowledge and Data Engineering

May 2018

Deal selection on Groupon is a typical social learning and decision making process, where the quality of a deal is usually unknown to the customers. The customers must acquire this knowledge through social learning from other social medias such as reviews on Yelp. Additionally, the quality of a deal depends on both the state of the vendor and decisions of other customers on Groupon. How social learning and network externality affect the decisions of customers in deal selection on Groupon is our main interest. We develop a data-driven game-theoretic framework to understand the rational deal selection behaviors cross social medias. The sufficient condition of the Nash equilibrium is identified. A value-iteration algorithm is proposed to find the optimal deal selection strategy. We conduct a year-long experiment to trace the competitions among deals on Groupon and the corresponding Yelp ratings. We utilize the dataset to analyze the deal selection game with realistic settings. Finally, the performance of the proposed social learning framework is evaluated with real data. The results suggest that customers do make decisions in a rational way instead of following naive strategies, and there is still room to improve their decisions with assistance from the proposed framework.

Hao-Wen Dong, Wen-Yi Hsiao, Li-Chia Yang, Yi-Hsuan Yang

MuseGAN: Multi-track sequential generative adversarial networks for symbolic music generation and accompaniment

AAAI Conference on Artificial Intelligence

February 2018

Generating music has a few notable differences from generating images and videos. First, music is an art of time, necessitating a temporal model. Second, music is usually composed of multiple instruments/tracks with their own temporal dynamics, but collectively they unfold over time interdependently. Lastly, musical notes are often grouped into chords, arpeggios or melodies in polyphonic music, and thereby introducing a chronological ordering of notes is not naturally suitable. In this paper, we propose three models for symbolic multi-track music generation under the framework of generative adversarial networks (GANs). The three models, which differ in the underlying assumptions and accordingly the network architectures, are referred to as the jamming model, the composer model and the hybrid model. We trained the proposed models on a dataset of over one hundred thousand bars of rock music and applied them to generate piano-rolls of five tracks: bass, drums, guitar, piano and strings. A few intratrack and inter-track objective metrics are also proposed to evaluate the generative results, in addition to a subjective user study. We show that our models can generate coherent music of four bars right from scratch (i.e. without human inputs). We also extend our models to human-AI cooperative music generation: given a specific track composed by human, we can generate four additional tracks to accompany it. All code, the dataset and the rendered audio samples are available at https://salu133445.github.io/musegan/.

Y.-S. Huang, S.-Y. Chou And Y.-H. Yang

Generating music medleys via playing music puzzle games

AAAI Conference on Artificial Intelligence

February 2018

Generating music medleys is about finding an optimal permutation of a given set of music clips. Toward this goal, we propose a self-supervised learning task, called the music puzzle game, to train neural network models to learn the sequential patterns in music. In essence, such a game requires machines to correctly sort a few multisecond music fragments. In the training stage, we learn the model by sampling multiple nonoverlapping fragment pairs from the same songs and seeking to predict whether a given pair is consecutive and is in the correct chronological order. For testing, we design a number of puzzle games with different difficulty levels, the most difficult one being music medley, which requiring sorting fragments from different songs. On the basis of state-of-the-art Siamese convolutional network, we propose an improved architecture that learns to embed frame-level similarity scores computed from the input fragment pairs to a common space, where fragment pairs in the correct order can be more easily identified. Our result shows that the resulting model, dubbed as the similarity embedding network (SEN), performs better than competing models across different games, including music jigsaw puzzle, music sequencing, and music medley. Example results can be found at our project website, https://remyhuang.github.io/DJnet.

Li-Chia Yang, Szu-Yu Chou, Yi-Hsuan Yang,

MidiNet: A convolutional generative adversarial network for symbolic-domain music generation

Proc. Int. Society for Music Information Retrieval Conf. (ISMIR)

October 2017

Most existing neural network models for music generation use recurrent neural networks. However, the recent WaveNet model proposed by DeepMind shows that convolutional neural networks (CNNs) can also generate realistic musical waveforms in the audio domain. Following this light, we investigate using CNNs for generating melody (a series of MIDI notes) one bar after another in the symbolic domain. In addition to the generator, we use a discriminator to learn the distributions of melodies, making it a generative adversarial network (GAN). Moreover, we propose a novel conditional mechanism to exploit available prior knowledge, so that the model can generate melodies either from scratch, by following a chord sequence, or by conditioning on the melody of previous bars (e.g. a priming melody), among other possibilities. The resulting model, named MidiNet, can be expanded to generate music with multiple MIDI channels (i.e. tracks). We conduct a user study to compare the melody of eight-bar long generated by MidiNet and by Google’s MelodyRNN models, each time using the same priming melody. Result shows that MidiNet performs comparably with MelodyRNN models in being realistic and pleasant to listen to, yet MidiNet’s melodies are reported to be much more interesting.