Situating Wearables: Smartwatch Use in Context

by Jinyoung Choi on 2018-08-06 11:00:58

Date : 2018. 08. 08 (Wed) 13:00 Locate : EB5. 533 Presenter : Jinyoung Choi   Title : Situating Wearables: Smartwatch Use in Context Author : Donald McMillan, Barry Brown, Airi Lampinen, Moira McGregor, Eve Hogga, Stefania Pizza (The University of Stockholm at Kista, Sweden)   Abstract : Drawing on 168 hours of video recordings of smartwatch use, this paper studies how context influences smartwatch use. We explore the effects of the presence of others, activity, location and time of day on 1,009 instances of use. Watch interaction is significantly shorter when in conversation than when alone. Activity also influences watch use with significantly longer use while eating than when socialising or performing domestic tasks. One surprising finding is that length of use is similar at home and work. We note that usage peaks around lunchtime, with an average of 5.3 watch uses per hour throughout a day. We supplement these findings with ... Continue reading →


CLBlast: A Tunes OpenCL BLAS Library

by Byungkyo Jung on 2018-07-20 17:28:45

Date : 2018.7.25 Locate : EB5. 533 Presenter : Byeongkyo Cheong Author : Cedric Nugteren Title : CLBlast: A Tuned OpenCL BLAS Library Abstract : This work introduces CLBlast, an open-source BLAS library providing optimized OpenCL routines to accelerate dense linear algebra for a wide variety of devices. It is targeted at machine learning and HPC applications and thus provides a fast matrix-multiplication routine (GEMM) to accelerate the core of many applications (e.g. deep learning, iterative solvers, astrophysics, computational fluid dynamics, quantum chemistry). CLBlast has five main advantages over other OpenCL BLAS libraries: 1) it is optimized for and tested on a large variety of OpenCL devices including less commonly used devices such as embedded and low-power GPUs, 2) it can be explicitly tuned for specific problem-sizes on specific hardware platforms, 3) it can perform operations in half-precision floating-point FP16 saving bandwidth, time and energy, ... Continue reading →


네트워크 성능향상을 위한 시스템 호출 수준 코어 친화도

by Daeyoung Song on 2018-07-16 10:27:16

Data : 2018.7.18 (Wed) 13:00 Locate : EB5. 533 Presenter : Daeyoung Song Title : 네트워크 성능향상을 위한 시스템 호출 수준 코어 친화도 Author : 엄준용, 조중연, 진현욱 Abstract : Existing operating systems experience scalability issues as the number of cores increases. The network I/O performance on manycore systems is faced with the major limiting factors of cache consistency costs and locking overheads. Legacy methods resolve this issue include the new microkernel-like operating system or modification of existing kernels; however, these solutions are not fully application transparent. In this study, we proposed a library that improves the network performance by separating system call context from user context and by applying the core affinity without any kernel and application modifications. Experiment results showed that our implementation can improve the network throughput of Apache by up to 30%. Continue reading →


멀티코어 환경에서 비실시간 메시지의 응답시간 지연을 최소화하는 리눅스 기반 메시지 처리기의 설계 및 구현

by Hyeoksoo Jang on 2018-06-27 15:08:03

Date : 2018. 07. 011 (Wed) 13:00 Locate : EB5. 533 Presenter : Hyeoksoo Jang Title : 멀티코어 환경에서 비실시간 메시지의 응답시간 지연을 최소화하는 리눅스 기반 메시지 처리기의 설계 및 구현 Author : 왕상호, 박영훈, 박성용, 김승춘, 김철회, 김상준, 진 철 Link : Continue reading →


by Hyeoksoo Jang on 2018-06-27 14:57:42

Continue reading →


S3DNN: Supervised Streaming and Scheduling for GPU-Accelerated Real-Time DNN Workloads

by Jinse Kwon on 2018-06-15 11:55:01

Date : 2018. 06. 20 (Wed) 13:00 Locate : EB5. 533 Presenter : Jinse Kwon   Title : S3DNN: Supervised Streaming and Scheduling for GPU-Accelerated Real-Time DNN Workloads Author : Husheng Zhou,  Soroush Bateni,  Cong Liu (The University of Texas at Dallas)   Abstract : Deep Neural Networks (DNNs) are being widely applied in many advanced embedded systems that require autonomous decision making, e.g., autonomous driving and robotics. To handle resource-demanding DNN workloads, graphic processing units (GPUs) have been used as the main acceleration engine. Although much research has been conducted to algorithmically optimize the efficiency of applying DNN to applications such as object recognition, limited attention has been given to optimizing the execution of GPU accelerated DNN workloads at the system level. In this paper, we propose S3DNN, a system solution that optimizes the execution of DNN workloads on GPU in a ... Continue reading →


Rehearsal: Reducing Convolutional Neural Network for Object Detection on Embedded Devices

by Do Trung Hai on 2018-05-29 15:09:39

Date : 2018. 05. 30 (Wed) 13:00 Locate : EB5. 533 Presenter : Trunghai Do Abstract :  Nowadays, convolutional neural networks (CNNs) become the center of many computer vision solutions to solve a variety of tasks. However, memory and computation are two of the most important characteristics of deep neural networks. These characteristics make neural networks difficult to effectively deploy on limited hardware resources such as embedded systems. Furthermore, to deploy models for devices and update them regularly, the model size needs to be small. In this thesis, we propose DroidDet a small fully convolutional neural network. Our DroidDet adopts You Only Look Once (YOLO) object detection algorithm for the ARM Mali-T628 MP6 GPU of ODROID-XU4. In order to build DroidDet, we not only replace all the fully connected layers that act as detection layers in YOLO with convolutional layers but also rearrange some of the convolutional layers to reduce the model size and ... Continue reading →


매니코어 환경에서 리눅스 커널의 공유 메모리 관리에 대한 문제점 분석

by Byungkyo Jung on 2018-05-17 20:49:05

Data : 2018.5.23 (Wed) 13:00 Locate : EB5. 533 Presenter : Byeongkyo Cheong Title : 매니코어 환경에서 리눅스 커널의 공유 메모리 관리에 대한 문제점 분석 Author : 서동주, 경주현, 임성수 Continue reading →


HeartChat: Heart Rate Augmented Mobile Messaging to Support Empathy and Awareness

by Jinyoung Choi on 2018-05-11 16:24:57

Date : 2018. 5. 16 (Wed) 13:00 Locate : EB5. 533 Presenter : Jinyoung Choi   Title : HeartChat: Heart Rate Augmented Mobile Messaging to Support Empathy and Awareness Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems May 06-11, 2017 ISBN : 978-1-4503-4655-9 doi : 10.1145/3025453.3025758   Author : Mariam Hassib, Daniel Buschek, Paweł W. Wo´zniak, Florian Alt, LMU Munich-Ubiquitous Interactive Systems Group, University of Stuttgart - VIS, Stuttgart, Germany Abstract : Textual communication via mobile phones suffers from a lack of context and emotional awareness. We present a mobile chat application, HeartChat, which integrates heart rate as a cue to increase awareness and empathy. Through a literature review and a focus group, we identified design dimensions important for heart rate augmented chats. We created three concepts showing heart rate per message, in real-time, or sending it explicitly. ... Continue reading →


Sparsification and Separation of Deep Learning Layers for Constrained Resource Inference ...

by Jinse Kwon on 2018-03-30 16:04:38

Date : 2018. 04. 04 (Wed) 13:00 Locate : EB5. 533 Presenter : Jinse Kwon   Title : Sparsification and Separation of Deep Learning Layers for Constrained Resource Inference on Wearables Author : Sourav Bhattacharya, Nicholas D. Lane (Nokia Bell Labs and University College London)   Abstract : Deep learning has revolutionized the way sensor data are analyzed and interpreted. The accuracy gains these approaches offer make them attractive for the next generation of mobile, wearable and embedded sensory applications. However, state-of-the-art deep learning algorithms typically require a significant amount of device and processor resources, even just for the inference stages that are used to discriminate high-level classes from low-level data. The limited availability of memory, computation, and energy on mobile and embedded platforms thus pose a significant challenge to the adoption of these powerful learning techniques. In this paper, we ... Continue reading →


리눅스 기반 모바일 기기에서 사용자 응답성 향상을 위한 프레임워크 지원 선별적 페이지 보호 기법

by Byungkyo Jung on 2018-03-14 15:48:47

Data : 2018.3.21 (Wed) 13:00 Locate : EB5. 533 Presenter : Byeongkyo Cheong Author : 김승준, 김정호, 홍성수 Abstract : While Linux-based mobile devices such as smartphones are increasingly used, they often exhibit poor response time. One of the factors that influence the user-perceived interactivity is the high page fault rate of interactive tasks. Pages owned by interactive tasks can be removed from the main memory due to the memory contention between interactive and background tasks. Since this increases the page fault rate of the interactive tasks, their executions tend to suffer from increased delays. This paper proposes a framework-assisted selective page protection mechanism for improving interactivity of Linux-based mobile devices. The framework-assisted selective page protection enables the run-time system to identify interactive tasks at the framework level and to deliver their IDs to the kernel. As a result, the kernel can maintain the pages owned by the identified ... Continue reading →


Predicting Latent Narrative Mood Using Audio and Physiologic Data

by Jinyoung Choi on 2018-03-11 11:05:37

Date : 2018. 3. 13 (Wed) 13:00 Locate : EB5. 533 Presenter : Jinyoung Choi   Title : Predicting Latent Narrative Mood Using Audio and Physiologic Data AAAI 2017, 948-954 Author : Tuka Waddah AlHanai, Mohammad MAhdi Ghassemi, Massachusetts Institute of Techonology, Cambridge MA, USA Abstract : Inferring the latent emotive content of a narrative requires consideration of para-linguistic cues (e.g. pitch), linguistic content (e.g. vocabulary) and the physiological state of the narrator (e.g. heart-rate). In this study we utilized a combination of auditory, text, and physiological signals to predict the mood (happy or sad) of 31 narrations from subjects engaged in personal story-telling. We extracted 386 audio and 222 physiological features (using the Samsung Simband) from the data. A subset of 4 audio , 1 text, and 5 physiologic features were identified using Sequential Forward Selection (SFS) for inclusion in a Neural Network (NN). These features ... Continue reading →


MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

by Do Trung Hai on 2018-02-05 11:28:34

Date : 2018. 02. 05 (Mon) 17:00 Locate : EB5. 533 Presenter : Trunghai Do Author : Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, Hartwig Adam  Abstract : We present a class of efficient models called MobileNets for mobile and embedded vision applications. MobileNets are based on a streamlined architecture that uses depthwise separable convolutions to build light weight deep neural networks. We introduce two simple global hyperparameters that efficiently trade off between latency and accuracy. These hyper-parameters allow the model builder to choose the right sized model for their application based on the constraints of the problem. We present extensive experiments on resource and accuracy tradeoffs and show strong performance compared to other popular models on ImageNet classification. We then demonstrate the effectiveness of MobileNets across a wide ... Continue reading →


The Performance of μ-Kernel-Based Systems

by Daeyoung Song on 2018-01-12 17:45:41

Title : The Performance of μ-Kernel-Based Systems SOSP(Symposium on Operating Systems Principles) ‘97 Author : Hermann H¨artig, Michael Hohmuth, Sebastian Sch¨onberg Jean Wolter (Department of Computer Science, Dresden University of Technology, Germany), Jochen Liedtke (IBM T.J. Watson Research Center)  Abstract : First-generation p-kernels have a reputation for being too slow and lacking sufficient flexibility. To determine whether LA, a lean second generation p-kernel, has overcome these limitations, we have repeated several earlier experiments and conducted some novel ones. Moreover, we  ported the Linux operating system to run on top of the L4 p-kernel and compared the resulting system with both Linux running native, and MkLinux, a Linux version that executes on top of a first-generation Mach derived p-kernel. For L4Linux, the AIM benchmarks report a maximum\ throughput which is only 5% lower than that of native Linux. The  ... Continue reading →


HiCH: Hierarchical Fog-Assisted Computing Architecture for Healthcare IoT

by Sihyeong Park on 2017-11-14 17:37:30

Date : 2017. 11. 20 (Mon) 17:00 Locate : EB5. 533 Presenter : Sihyeong Park   Title : HiCH: Hierarchical Fog-Assisted Computing Architecture for Healthcare IoT ACM Transactions on Embedded Computing Systems (TECS) - Special Issue ESWEEK 2017, CASES 2017, CODES + ISSS 2017 and EMSOFT 2017 TECS  Volume 16 Issue 5s, October 2017  Article No. 174    Author : Iman Azimi University of Turku, Turku, Finland, Arman Anzanpour University of Turku, Turku, Finland, Amir M. Rahmani University of California Irvine and TU Wien, Irvine, CA, USA, Tapio Pahikkala University of Turku, Turku, Finland, Marco Levorato University of California Irvine, Irvine, CA, USA, Pasi Liljeberg University of Turku, Turku, Finland, Nikil Dutt University of California Irvine, Irvine, CA, US Abstract : The Internet of Things (IoT) paradigm holds significant promises for remote health monitoring systems. Due to their life- or mission-critical nature, these ... Continue reading →