BACK TO INDEX

All publications sorted by (null)
    1. Exploring an Aspect-Oriented Approach to OS Code, 2001. 4th ECOOP Workshop on Object-Orientation and Operating Systems. [WWW ] [bibtex-key = coady-aopos]


    2. P. Jalote. Fault-Tolerance in Distributed Systems. Prentice Hall, 1994. [WWW ] [bibtex-key = Jalote94]


    3. Gänther Bauer. Transparent Fault Tolerance in a Time-Triggered Architecture. PhD thesis, Technische Universität Wien, Institut für Technische Informatik, Treitlstr. 3/3/182-1, 1040 Vienna, Austria, 2001 . [bibtex-key = bauer2001]


    4. S. S. Kulkarni. Component-based design of fault-tolerance. PhD thesis, Ohio State University, 1999. [WWW ] [bibtex-key = kulkarni-thesis]


    5. Zhiming Liu. Fault-Tolerant Programming by Transformations. PhD thesis, University of Warwick, 1991. [WWW ] [bibtex-key = thesis]


    6. Kwang Sik Chung, YoungJun Lee, HeoChang Yu, and WonGyu Lee. Management of Fault Tolerance Information for Coordinated Checkpointing Protocol without Sympathetic Rollbacks. Journal of Information Science and Engineering, 20(2):379-390, March 2004. [WWW ] [bibtex-key = CP]


    7. Chi-Yi Lin, Szu-Chi Wang, and Sy-Yen Kuo. An efficient time-based checkpointing protocol for mobile computing systems over mobile IP. Mob. Netw. Appl., 8(6):687--697, 2003. [WWW ] [bibtex-key = mobileCP]


    8. S. Kalaiselvi and V. Rajaraman. A Survey of Checkpointing Algorithms for Parallel and Distributed Computers. Sadhana, 25(5):489-510, october 2000. [WWW ] [bibtex-key = surveyCP]


    9. S. S. Kulkarni and A. Arora. Automating the Addition of Fault-Tolerance. Formal Techniques in Real-Time and Fault-Tolerant Systems, 2000. [WWW ] [bibtex-key = kulkarni-ftrtft00]


    10. Felix C. Gartner. Fundamentals of fault-tolerant distributed computing in asynchronous environments. ACM Comput. Surv., 31(1):1--26, 1999. [WWW ]
      Annotation:
      A survey on FT. Section 3, 4, 5 are noticeable, which give formal approach to FT, redundancy (introduced as must but not enough for FT).
      [bibtex-key = p1-gartner]


    11. Zhiming Liu and Mathai Joseph. Specification and verification of fault-tolerance, timing, and scheduling. ACM Trans. Program. Lang. Syst., 21(1):46--89, 1999. [bibtex-key = Liu-ft-rt]


    12. D. Manivannan and M. Singhal. Quasi-Synchronous Checkpointing: Models, Characterization, and Classification. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 10(7):703-713, 1999. [WWW ] [bibtex-key = Quasi-SyncCP]


    13. Anh Nguyen-Tuong and Andrew S. Grimshaw. Using Reflection for Incorporating Fault-Tolerance Techniques into Distributed Applications. Parallel Processing Letters, 9(2):291-301, 1999. [WWW ] [bibtex-key = nguyentuong99using]


    14. Santiago Rodriquez, Antonio Perez, and Rafael Mendez. A new checkpoint mechanism for real time operating systems. SIGOPS Oper. Syst. Rev., 31(4):55--62, 1997. [WWW ]
      Annotation:
      An approach that proposes the extension of some real time system calls in order to save a recovery point when the user invokes them. If checkpointing is frequently done, computer performance will be decreased because a great amount of temporal data will be stored. We try to reduce the performance loss in two ways: checkpoints are saved only at the end of a control cycle (the number of checkpoints is reduced) and only when a write is done (the number of system calls affected are decreased). checkpoint is done at the end of these services.
      [bibtex-key = p55-rodriguez]


    15. Avi Ziv and Jehoshua Bruck. An On-Line Algorithm for Checkpoint Placement. IEEE Trans. Comput., 46(9):976--985, 1997. [WWW ]
      Annotation:
      Explains fixed interval CP together with optimization, and then an online alg. for CP placement. Performance comparison is made.
      [bibtex-key = t0976]


    16. Francisco V. Brasileiro, Paul Devadoss Ezhilchelvan and Santosh K. Shrivastava, Neil A. Speirs, and S. Tao. Implementing Fail-Silent Nodes for Distributed Systems. IEEE Trans. Comput., 45(11):1226--1238, 1996. [WWW ] [bibtex-key = fail-silent]


    17. E. Kindler. Safety and Liveness Properties: A Survey. Bulletin of the European Association for Theoretical Computer Science, 53:268--272, 1994. [WWW ]
      Annotation:
      A very short survey explaining SAFETY and LIVENESS properties for distrbuted systems
      [bibtex-key = kindler94safety]


    18. Hermann Kopetz and Gunter Grunsteidl. TTP-A Protocol for Fault-Tolerant Real-Time Systems. Computer, 27(1):14--23, 1994. [WWW ] [bibtex-key = ttp-94]


    19. Michael Barborak, Anton Dahbura, and Minoslaw Malek. The consensus problem in fault-tolerant computing. ACM Comput. Surv., 25(2):171--220, 1993. [WWW ] [bibtex-key = p171-barborak]


    20. F. Cristian. Understanding Fault-Tolerant Distributed Systems. Communication of the ACM, 34(2):56--78, February 1991. [WWW ]
      Annotation:
      The paper proposes basic concepts which are used to explain hardware and software architecture for fault-tolerant distributed systems. For example, the concepts of server, depends on relation, failure, failure semantics. The general issues in hardware and software architectures are presented. For software fault-tolerance, the issues related to synchronisation (close ou loose) are discussed. The paper was revised in 1993, but the references are before 1990. Some examples of industrial faul-tolerant architectures are given. \availableat\ ftp://ftp.cs.ucsd.edu/pub/team/understandingftsystems.ps.Z
      [bibtex-key = cristian91]


    21. Victor P. Nelson. Fault-Tolerant Computing: Fundamental Concepts. Computer, 23(7):19--25, 1990. [WWW ] [bibtex-key = nelson90]


    22. Fred B. Schneider. Implementing fault-tolerant services using the state machine approach: a tutorial. ACM Comput. Surv., 22(4):299--319, 1990. [WWW ] [bibtex-key = sch90a]


    23. Richard Koo and Sam Toueg. Checkpointing and rollback-recovery for distributed systems. IEEE Trans. Softw. Eng., 13(1):23--31, 1987. [WWW ] [bibtex-key = p1150-koo]


    24. K. Mani Chandy and Leslie Lamport. Distributed snapshots: determining global states of distributed systems. ACM Trans. Comput. Syst., 3(1):63--75, 1985. [WWW ] [bibtex-key = p63-chandy]


    25. F. Schneider. Byzantine generals in action: implementing fail-stop processors. ACM Transactions on Computer Systems (TOCS), 2(2):145-154, 1984. [WWW ] [bibtex-key = ByzGensInAction]


    26. Richard D. Schlichting and Fred B. Schneider. Fail-Stop Processors: An Approach to Designing Fault-Tolerant Computing Systems. ACM Transactions on Computer Systems, 1(3):222-238, 1983. [WWW ] [bibtex-key = failstop]


    27. Karine Altisen, Florence Maraninchi, and David Stauch. Exploring Aspects in the Context of Reactive Systems. In Foundations of Aspect-Oriented Languages workshop, Lancaster, UK, March 2004. [WWW ] [bibtex-key = altisen2004]


    28. S. S. Kulkarni and A. Ebnenasir. Automated Synthesis of Multitolerance. In International Conference on Dependable Systems and Networks, Florence, Italy, June 2004. [WWW ] [bibtex-key = kulkarnidsn04]


    29. M. Baleani, A. Ferrari, L. Mangeruca and M. Peri, S. Pezzini, and A. Sangiovanni-Vincentelli. Fault-Tolerant Platforms for Automotive Safety-Critical Applications. In International Conference on Compilers, Architectures and Synthesis for Embedded Systems, CASES'03, San Jose, USA, November 2003. ACM. [WWW ]
      Annotation:
      The authors show that fail-silent platforms can be realized with limited area overhead and virtually no performance penalty. Five architectures are compared: (1) a single CPU with no error correcting coding techniques (ECC); (2) a lock-step dual processor architecture with ECC for bus and memory access; (3) a shared memory loosely synchronized dual processor architecture with ECC for memory access; (4) a triple modular redundant architecture with ECC for bus and memory access; (5) a shared memory dual lock-step architecture (i.e. with 4 processors) with with ECC for bus and memory access. For low range X-by-wire automotive systems, architectures (2) or (3) are the best solutions. For high range systems, architecture (5) is the best solution.
      [bibtex-key = baleani03]


    30. Andreas Gal, Wolfgang Schröder-Preikschat, and Olaf Spinczyk. On Aspect-Orientation in Distributed Real-time Dependable System. In The Seventh IEEE International Workshop on Object-oriented Real-time Dependable Systems (WORDS 2002), San Diego, CA, UAS, January 7-9 2002. [WWW ] [bibtex-key = gal-words2002]


    31. Wolfgang Schult and Andreas Polze. Aspect-Oriented Programming with C# and .NET. In 5th IEEE International Symposium on Object-oriented Real-time distributed Computing, Washington, DC, pages 241-248, May 2002. IEEE Computer Society Press. [WWW ] [bibtex-key = schult-isorc2002]


    32. Yvonne Coady, Gregor Kiczales, Mike Feeley, and Greg Smolyn. Using AspectC to Improve the Modularity of Path-Specific Customization in Operating System Code. In Joint European Software Engineering Conference (ESEC) and 9th ACM SIGSOFT International Symposium on the Foundations of Software Engineering (FSE-9), September 2001. [WWW ] [bibtex-key = coadypsc]


    33. F. Sánchez and J. Herrer. Fault Tolerance as an Aspect Using Jreplica. In Proceedings of the 8th IEEE Workshop on Future Trends of Distributed Computing Systems, pages 201, 2001. IEEE Computer Society. [WWW ] [bibtex-key = aspect-JReplica]


    34. Renaud Pawlak, Laurence Duchien, Gerard Florin, Laurent Martelli, and Lionel Seinturier. Distributed Separation of Concerns with Aspect Components. In Proceedings of the Technology of Object-Oriented Languages and Systems (TOOLS 33), pages 276, 2000. IEEE Computer Society. [WWW ] [bibtex-key = pawlak00]


    35. A. La Rosa, C. Passerone, L. Lavagno, F. Gregoretti, and A. Sangiovanni-Vincentelli. Finite State Machine Composition for Embedded Hardware/Software Trade-offs. In Proceedings of the 2nd IEEE International Workshop on Design, Test and Applications (WDTA'99), Dubrovnik, Croatia, pages 21-24, June 14-16 1999. [WWW ] [bibtex-key = RosaFSMforEmbedded]


    36. A. Arora and S. S. Kulkarni. Detectors and Correctors: A Theory of Fault-Tolerance Components. In Proceedings of the The 18th International Conference on Distributed Computing Systems, pages 436, 1998. IEEE Computer Society. [WWW ]
      Annotation:
      The paper claims that a fault-tolerant system consists of a fault-intolerant system and a set of fault-tolerance components. These FT components are introduced as Detectors (comparators, error detection codes, consistency checkers, watchdog programs, snoopers, alarms, snapshot procedures, acceptance tests, and exception conditions) and Correctors (voters, error correction codes, reset procedures, rollback recovery, rollforward recovery, constraint (re)satisfaction, exception handlers, and alternate procedures in recovery blocks). Gives formal approach to program, fault, specification FT, etc. This paper is included in kulkarni-thesis.pdf.
      [bibtex-key = detector-corrector]


    37. Marcos Kawazoe Aguilera, Wei Chen, and Sam Toueg. Heartbeat: A Timeout-Free Failure Detector for Quiescent Reliable Communication. In Proceedings of the 11th International Workshop on Distributed Algorithms, pages 126--140, 1997. Springer-Verlag. [WWW ] [bibtex-key = heartbeat]


    38. A.R. Nayak, Wen-Ben Jone, and S.R. Das. Designing general-purpose fault-tolerant distributed systems a layered approach. In International conference on Parallel and Distributed Systems, pages 360-364, 19-21 December 1994. IEEE. [WWW ] [bibtex-key = FT-dist-layered]


    39. Anurag Aggarwal and Diwaker Gupta. Failure Detectors for Distributed Systems. Technical report, Indian Institute of Technology, 2002. [WWW ] [bibtex-key = ds]


    40. Torres Wilfredo. Software Fault Tolerance: A Tutorial. Technical report, NASA Langley Technical Report Server, 2000. [WWW ] [bibtex-key = NASA-SW-FT]


    41. Micah Beck, James S. Plank, and Gerry Kingsley. Compiler-Assisted Checkpointing. Technical report, University of Tennessee, 1994. [WWW ] [bibtex-key = compilerCP]


    42. P.P. Shirvani, N. Saxena, N. Oh, S. Mitra, S.-Y. Yu, W.-J. Huang, S. Fernandez-Gomez, N.A. Touba, and E.J. McCluskey. Fault-Tolerance Projects at Stanford CRC, 1999. [WWW ] [bibtex-key = shirvani99FT]


    43. C. Tanzer, S. Poledna, E. Dilger, and T. Führer. A Fault-Tolerance Layer for Distributed Fault-Tolerant Hard Real-Time Systems. Note: In www.tttech.com/technology/docs/history/TTTech_1998-Fault-Tolerance_Layer.pdf, 1998. [WWW ] [bibtex-key = FT-layer]


    44. Curtis Clifton, Gary T. Leavens, and Mitchell Wand. Parameterized Aspect Calculus: A Core Calculus for the Direct Study of Aspect-Oriented Languages. [WWW ] [bibtex-key = calculus-Aspect]



    BACK TO INDEX




    Disclaimer:

    This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All person copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

    Les documents contenus dans ces répertoires sont rendus disponibles par les auteurs qui y ont contribué en vue d'assurer la diffusion à temps de travaux savants et techniques sur une base non-commerciale. Les droits de copie et autres droits sont gardés par les auteurs et par les détenteurs du copyright, en dépit du fait qu'ils présentent ici leurs travaux sous forme électronique. Les personnes copiant ces informations doivent adhérer aux termes et contraintes couverts par le copyright de chaque auteur. Ces travaux ne peuvent pas être rendus disponibles ailleurs sans la permission explicite du détenteur du copyright.




    Last modified: Mon Oct 25 11:18:46 2004
    Author: ayav.


    This document was translated from BibTEX by bibtex2html