資訊科技創新研究中心 | 近期研究成果

J.-Y. Liu And Y.-H. Yang

Event localization in music auto-tagging

ACM Multimedia (full paper)

October 2016

In music auto-tagging, people develop models to automati- cally label a music clip with attributes such as instruments, styles or acoustic properties. Many of these tags are actu- ally descriptors of local events in a music clip, rather than a holistic description of the whole clip. Localizing such tags in time can potentially innovate the way people retrieve and interact with music, but little work has been done to date due to the scarcity of labeled data with granularity speci c enough to the frame level. Most labeled data for training a learning-based model for music auto-tagging are in the clip level, providing no cues when and how long these attributes appear in a music clip. To bridge this gap, we propose in this paper a convolutional neural network (CNN) architec- ture that is able to make accurate frame-level predictions of tags in unseen music clips by using only clip-level anno- tations in the training phase. Our approach is motivated by recent advances in computer vision for localizing visual objects, but we propose new designs of the CNN architec- ture to account for the temporal information of music and the variable duration of such local tags in time. We re- port extensive experiments to gain insights into the prob- lem of event localization in music, and validate through ex- periments the e ectiveness of the proposed approach. In addition to quantitative evaluations, we also present quali- tative analyses showing the model can indeed learn certain characteristics of music tags.

Yu-Ming Chang, Pi-Cheng Hsiu, Yuan-Hao Chang, Chi-Hao Chen, Tei-Wei Kuo, And C. Y. Michael Wang

Improving PCM Endurance with a Constant-cost Wear Leveling Design

ACM Transactions on Design Automation of Electronic Systems

June 2016

Improving PCMendurance is a fundamental issue when it is considered as an alternative to replace DRAM as main memory. Memory-based wear leveling (WL) is an effective way to improve PCM endurance, but its major challenge is how to efficiently determine the appropriate memory pages for allocation or swapping. In this article, we present a constant-cost WL design that is compatible with existing memory management. Two implementations, namely bucket-based and array-based WL, with constant-time (or nearly zero) search cost are proposed to be integrated into the OS layer and the hardware layer, respectively, as well as to trade between time and space complexity. The results of experiments conducted based on an implementation in Android, as well as simulations with popular benchmarks, to evaluate the effectiveness of the proposed design are very encouraging.

C.-P. Wei And Y.-C. F. Wang

With One Look: 3D Face Shape Estimation from a Single Snapshot

IEEE International Conference on Multimedia and Expo (ICME)

July 2016

N/A

K.-S. Huang, P.-J. Chiu, H.-M. Tsai, C.-C. Kuo, H.-Y. Lee, And Y.-C. F. Wang

RedEye: Preventing Collisions Caused by Red-Light Running Scooters with Smartphones

IEEE Transactions on Intelligent Transportation Systems (ITS)

May 2016

N/A

C.-H. Chang, R. Y. Chang, And F.-T. Chien

Energy-Assisted Information Detection for Simultaneous Wireless Information and Power Transfer: Performance Analysis and Case Studies

IEEE Transactions on Signal and Information Processing over Networks

June 2016

In simultaneous wireless information and power transfer (SWIPT), practical receiver architectures consisting of an information receiver and an energy harvester have been proposed in place of an ideal receiver capable of performing two tasks simultaneously using the same circuits. In this paper, we present a novel receiver architecture design incorporating an interplay between the information receiver and the energy harvester to enhance the performance of the practical SWIPT receiver. In particular, the energy level of the received signal monitored at the energy harvester is fed back to the information receiver to assist information decoding at the information receiver. The symbol-error-rate (SER) and diversity analyses show that the proposed receiver architecture could yield a higher diversity order for unconventional constellations where any two distinct symbols have distinct energy levels, and the same diversity order for conventional modulations. Simulation of PAM and QAM modulations verifies the analysis, shows the improved SER performance of the proposed receiver architecture, and illustrates the energy-dimension-augmented decision regions. Some insights into designing enhanced practical SWIPT receivers are provided as a result.

K.-L. Hua, K.-H. Lo, And Y.-C. F. Wang

Extended Guided Filtering for Depth Map Upsampling

IEEE MultiMedia

April 2016

N/A

K.-L. Hua, S. C. Hidayati, F.-L. He, C.-P. Wei And Y.-C. F. Wang

Context-Aware Joint Dictionary Learning for Color Image Demosaicking

Journal of Visual Communication and Image Representation

July 2016

Most digital cameras are overlaid with color filter arrays (CFA) on their electronic sensors, and thus only one particular color value would be captured at every pixel location. When producing the output image, one needs to recover the full color image from such incomplete color samples, and this process is known as demosaicking. In this paper, we propose a novel context-constrained demosaicking algorithm via sparse-representation based joint dictionary learning. Given a single mosaicked image with incomplete color samples, we perform color and texture constrained image segmentation and learn a dictionary with different context categories. A joint sparse representation is employed on different image components for predicting the missing color information in the resulting high-resolution image. During the dictionary learning and sparse coding processes, we advocate a locality constraint in our algorithm, which allows us to locate most relevant image data and thus achieve improved demosaicking performance. Experimental results show that the proposed method outperforms several existing or state-of-the-art techniques in terms of both subjective and objective evaluations.

Y.-H. H. Tsai, Y.-R. Yeh, And Y.-C. F. Wang

Learning Cross-Domain Landmarks for Heterogeneous Domain Adaptation

IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), Poster Session

June 2016

N/A

Yu-Wen Jong, Pi-Cheng Hsiu, Sheng-Wei Cheng, And Tei-Wei Kuo

A Semantics-Aware Design for Mounting Remote Sensors on Mobile Systems

IEEE/ACM Design Automation Conference (DAC)

June 2016

Application paradigms will increasingly exceed a mobile device's physical boundaries. This paper presents a system solution for a mobile device to mount remote sensors on other devices. Our design is generic to mobile senor stacks, thus supporting unmodified apps and commodity sensors. Furthermore, it uses an asynchronous access model to facilitate semantics passing and data reporting in between. Such semantic information allows the development of an energy-efficient reporting policy for remote sensing applications. The results of experiments conducted on commercial Android smartphones with popular apps demonstrate that our design is very efficient in terms of energy consumption and completion time.

Chun-Hao Kao, Sheng-Wei Cheng, And Pi-Cheng Hsiu

Similarity-Based Wakeup Management for Mobile Systems in Connected Standby

IEEE/ACM Design Automation Conference (DAC)

June 2016

Resident applications, which autonomously awaken mobile devices, can gradually and imperceptibly drain device batteries. This paper introduces the concept of alarm similarity into wakeup management for mobile systems in connected standby. First, we define hardware similarity to reflect the degree of energy savings and time similarity to reflect the impact on user experience. We then propose a policy that aligns alarms based on their similarity to save standby energy while maintaining the quality of the user experience. Finally, we integrate our design into Android and conduct extensive experiments on a commercial smartphone running popular mobile apps. The results demonstrate that our design can further extend the standby time achieved with Android's native policy by up to one-third.